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Summary 
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Gender gaps in assessment outcomes 
in Vermont and the United States 



Using data from the National Assess- 
ment of Educational Progress (NAEP) and 
the New England Common Assessment 
Program (NECAP), the report examines 
how gender gaps differ between Ver- 
mont NAEP scores and U.S. NAEP scores 
and between Vermont NAEP and NECAP 
scores in grades 4 and 8. Overall and 
disaggregated by poverty and disability 
status, gender achievement gaps in Ver- 
mont resembled those in the country as a 
whole except in a few cases. 

Vermont education leaders remain concerned 
about the size of gender gaps in statewide 
assessment data. They question whether the 
gender gaps in Vermont differ from gaps in 
the country as a whole and whether they differ 
when measured by the state assessment (the 
NECAP) and by the NAEP in Vermont. 

Current debates over gender gaps occur within 
a context of heightened national focus on 
achievement gaps among students. The No 
Child Left Behind (NCLB) Act of 2001 has 
increased pressure on states, districts, and 
public schools to close such gaps in reading and 
math. Under the act states must work to address 
achievement gaps to ensure that all students 
reach proficiency by 2014. States are required 
to conduct annual assessments in reading and 



math in grades 3-8 and once in high school and 
to report the results by student poverty, race/ 
ethnicity, disability, and limited English pro- 
ficiency status. Interest in gender gaps is thus 
high among many education leaders as they 
tackle issues of education equity and strive to 
improve achievement outcomes for all students. 

By comparing Vermont and U.S. NAEP read- 
ing, writing, and math assessment data for 
2000-07 and by examining Vermont NECAP 
reading and math data for 2006 and writing 
data for 2002, this report addresses the follow- 
ing questions: 

• Do gender gaps differ significantly be- 
tween Vermont and U.S. NAEP scores? 

• How do gender gaps differ between Ver- 
mont’s NECAP scores and its NAEP scores? 

The study found that: 

• In reading, gender gaps in Vermont and 
U.S. NAEP scores have typically not dif- 
fered at a statistically significant level. 
From 2002 to 2007 grade 4 girls outscored 
boys by 5-8 points (or 7-10 percentiles) on 
average, and grade 8 girls outscored boys 
by 9-13 points (or 11-16 percentiles) on 
average, in both Vermont and nationwide. 
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In writing, gender gaps in 2002 Vermont 
and U.S. NAEP scores did not differ at a 
statistically significant level. The grade 4 
gender gap was 21 points (or 23 percen- 
tiles) in Vermont and 18 points (or 19 per- 
centiles) nationwide. And the grade 8 gap 
was 24 points in Vermont and 21 points 
(or 25 and 22 percentiles) nationwide. 

In math, gender gaps in Vermont and U.S. 
NAEP scores did not differ at a statistically 
significant level. Boys outscored girls by 5 
points (8 percentiles) or less on average in 
2000-07. 

In all but a few cases the gender gaps in 
reading and math scores on the Vermont 
NECAP differed from those of the Ver- 
mont NAEP by 5 percentile points or less. 
Gender gaps in 2006 NECAP scores were 
larger in reading and smaller in math 
than gender gaps in 2007 NAEP scores 
by 5 percentile points or less. Account- 
ing for disability and poverty subgroups, 
however, revealed several exceptions. The 
average NECAP and NAEP gender gaps 
in math scores differed by 12 percentile 
points among grade 8 students in poverty 



and by 7 percentile points among grade 8 
students with disabilities. 

Analysis of NAEP and NECAP data suggests 
areas for further research. For example, gender 
achievement gaps in Vermont resembled 
those in the country as a whole except in a few 
isolated cases. Future research could explore 
whether these instances represent broader 
trends or are statistical outliers. In addition, 
between grades 4 and 8 in both jurisdictions 
boys’ score advantage in math shrinks and 
girls’ score advantage in reading and writing 
grows. Future studies could examine whether 
changes in gender gaps occur between other 
grades and whether these changes reflect 
differences in development between boys and 
girls. Finally, gender gaps in all three content 
areas changed after controlling for student dis- 
ability status. Among students with disabili- 
ties in both grades and in both jurisdictions 
the gender gap was larger in math and smaller 
in reading and writing. These shifts raise ques- 
tions about how boys and girls are grouped in 
disability categories. This report outlines these 
issues and presents data to augment current 
knowledge about gender gaps in achievement 
in Vermont and the United States. 
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Using data from the 
National Assessment 
of Educational 
Progress (NAEP) and 
the New England 
Common Assessment 
Program (NECAP), the 
report examines how 
gender gaps differ 
between Vermont 
NAEP scores and 
U.S. NAEP scores and 
between Vermont 
NAEP and NECAP 
scores in grades 4 
and 8. Overall and 
disaggregated 
by poverty and 
disability status, 
gender achievement 
gaps in Vermont 
resembled those 
in the country as 
a whole except 
in a few cases. 



WHY THIS STUDY? 

Analysis of the gender gap in statewide assessment 
data is an important component of the Vermont 
Department of Education’s commitment to ad- 
dressing student achievement gaps within the 
state.' Concerned by differences in boys’ and girls’ 
scores on English language arts and math assess- 
ments in Vermont and by national media reports 
of gender disparities in schooling, the Vermont 
State Board and Commissioner of Education have 
requested an in-depth analysis of gender gaps in 
reading, writing, and math assessments among 
Vermont public school students (see box 1 for defi- 
nitions of key terms). These leaders wanted more 
information on the size of gender gaps among 
students disaggregated by poverty and disability 
status, on how gender gaps compare between 
Vermont and the country as a whole, and on how 
gender gaps compare between different assess- 
ments in Vermont — specifically, the statewide 
assessment used for meeting federal accountability 
requirements, the New England Common As- 
sessment Program (NECAP), and the nationwide 
assessment administered biannually by the federal 
government, the National Assessment of Educa- 
tional Progress (NAEP). 



Studying gender gaps 

Educators and the public have expressed concerns 
about differences in academic achievement be- 
tween boys and girls for decades, even as the focus 
of debates has shifted. In the 1990s the national 
report How schools shortchange girls (American 
Association of University Women 1992) focused 
the public spotlight on the academic plight of girls 
(Mead 2006). The report claimed that girls had 
received less attention from teachers than boys 
had and that girls had fallen short of boys in such 
areas as math and self-esteem by the end of high 
school. Girls also averaged lower scores on stan- 
dardized tests for college than boys did. And girls 
were much less likely than boys to pursue careers 
in the growing fields of science and technology, 
even when they excelled in those fields (American 
Association of University Women 1998). 
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BOX 1 

Definitions of key terms 

Effect size. A standardized measure 
of the difference between two group 
outcomes. Standardized effect sizes 
(often denoted in standard devia- 
tion units) help researchers compare 
outcomes with different units of 
measurement. For example, scores on 
the National Assessment of Edu- 
cational Progress (NAEP) and the 
New England Common Assessment 
Program (NECAP) have different 
scales. Converting achievement gaps 
on the NAEP and NECAP from their 
NAEP or NECAP scales to effect sizes 
measured in standard deviation units 
makes it easier to compare achieve- 
ment gaps from each assessment. 

Gender gap. A measure of the dif- 
ference between male and female 
outcomes. In this report a gender 
gap is calculated by subtracting the 
mean scale score of boys from the 
mean scale score of girls on a specific 
grade-level subject assessment. 

Percentile. A percentile is the value 
below which a certain percentage of 



data fall. For example, if a student 
scores at the 23rd percentile of a test 
score distribution, then 23 percent of 
other students achieved a lower score. 

Percentile difference. The percentile 
difference is the difference between 
two percentiles. For example, if boys 
score at the 48th percentile on an as- 
sessment, and girls score at the 52nd 
percentile, the percentile difference is 
equal to 4 percentile points. (Percen- 
tile differences are described in units 
of percentile points in this report.) 

Scale score. A scale score is a test 
score that has been converted from a 
raw figure to a number on a com- 
mon scale indicating a student’s 
performance level. NAEP scale scores 
range from 0 to 500 in reading and 
math and from 0 to 300 in writing 
for grades 4, 8, and 12. NECAP scale 
scores range from 400 to 480 for 
grade 4 and from 800 to 880 for grade 
8 in all content areas. 

Standard deviation. Standard devia- 
tion is a measure of how widely or 
narrowly data are dispersed around 
the data mean. For example, the 



standard deviation of a set of student 
test scores is calculated by sum- 
ming the squared deviations of each 
student’s individual score from the 
mean, dividing this sum by the total 
number of students, and taking the 
square root of the resulting figure. A 
student’s test score can be described 
in terms of standard deviation units 
by subtracting the mean from the 
student’s score and dividing that 
figure by the standard deviation. 

Standard error. Standard error is 
a measure of the amount of error 
between an estimated statistic from 
a sample and the true statistic for 
the population. For example, the 
mean test score for a sample of 
students will have a standard error 
that estimates the deviation between 
the sample mean and the mean for 
the entire student population. The 
standard error for a sample mean is 
calculated by dividing the standard 
deviation of the sample data by the 
square root of the number of subjects 
in the sample. 

For more detail on these terms, see 
appendix A. 



In recent years media attention has recognized the 
education plight of boys. Major newspapers, news 
magazines, and television news programs have fea- 
tured lead stories on boys’ academic performance — 
and whether there is a “boy crisis” in U.S. schools 
(Conlin 2003; Stahl 2003; Tyre 2006; Von Drehle 
2007). Worrisome trends include boys’ lower test 
scores in reading and writing, higher levels of be- 
havioral problems, higher placement rates in special 
education, and smaller gains in higher education 
compared with girls. Some researchers argue that 
earlier concerns about girls may have prompted 
schools to alter classroom practices in ways that 
benefit girls over boys (Conlin 2003; Stahl 2003). 



Previous findings 

Gender gaps in achievement have been docu- 
mented for decades. Nationwide, gender gaps in 
assessment outcomes have long existed in multiple 
content areas (Cole 1997; Coley 2001; Freeman 2004; 
Klecker 2006; Meadows, Land, and Lamb 2005; 
Nowell and Hedges 1998), with girls consistently 
outperforming boys in some areas, and boys out- 
performing girls in others. While some gaps have 
been persistent, others have declined over time. 

In national assessments of youth literacy skills 
girls have outscored boys since the 1960s. Studies 
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of national assessment outcomes find that the 
gender gap in writing has changed little over 
time — girls’ writing scores were approximately 
0.4 standard deviation higher than those of boys 
in both 1960 and 1990 (Cole 1997). Gender gaps 
in NAEP reading and writing scores are negligible 
when students first enter school but widen as 
students progress through grades 4-12 (Freeman 
2004). 

In contrast, boys have outperformed girls in 
math over the past several decades. National data 
suggest that boys and girls display similar math 
achievement levels when they begin school. From 
grade 3 onward boys outscore girls by a steady 
margin (Freeman 2004). Analyses of national 
assessment data show that the gender gap in math 
shrank dramatically over 1960-90, from an ef- 
fect size of 0.45 to 0.10, and has remained steady 
ever since (Cole 1997; McGraw, Fubienski, and 
Struchens 2006). 

Researchers have also identified gender gaps in 
test score variability. In most content areas boys’ 
scores tend to vary more than girls’ scores, and 
the difference in the standard deviations of boys’ 
and girls’ test scores tends to grow from grade 4 to 
grade 12 (Willingham and Cole 1997). The group 
with greater variation tends to become overrepre- 
sented at the top and bottom of the score distribu- 
tion. A study of NAFP data shows that in math 
and science boys are overrepresented in the upper 
tails of test score distributions and that in reading 
and writing girls are overrepresented at the top 
percentiles (Nowell and Hedges 1998).^ 

Studies show that gender gaps do not reflect 
achievement gaps among other student sub- 
groups— such gaps usually persist after controlling 
for background variables such as race/ethnicity 
and socioeconomic and disability status (Ameri- 
can Association of University Women 1998; Free- 
man 2004; FoGerfo, Nichols, and Chaplin 2006; 
McGraw, Fubienski, and Struchens 2006; Ready 
et al. 2005 )? Although gender gaps have typically 
been smaller than racial, socioeconomic, and 
disability gaps on a variety of assessments (U.S. 



Department of Education, National Center for 
Education Statistics 2005, 2006), they may be of 
particular concern within specific student sub- 
groups. Achievement (particularly in reading) is 
very low for boys in poverty (Mead 2006). Students 
with disabilities — a group in which boys form 
a disproportionate share — achieve much lower 
scores than their nondisabled counterparts (Mead 
2006). How gender gaps interact with socioeco- 
nomic, disability, and other gaps merits deeper 
and ongoing investigation. 



The gender gap in Vermont 



In recent years Vermont education leaders have ex- 
pressed a strong interest in learning about achieve 
ment gender gaps (Hayes 2007; Johnson 2002; 
Pandiani and Bramley 2002; Vermont Student 
Assistance Center 2005). 

Aware of existing gender 
gaps within the state 
and alarmed by national 
media reports on the un- 
derperformance of boys 
in English language arts 
and other areas nation- 
wide, the Vermont State 
Board and Commissioner 
of Education asked how 
state and national gender 
gaps compare, in the 

aggregate and disaggregated by student poverty 
and disability status. Students in poverty make 
up more than 26 percent of all students within the 
state, and students with disabilities make up 11 
percent (table Cl in appendix C). Because the state 
is accountable for the achievement of all students 
under the No Child Feft Behind (NCFB) Act of 
2001, state leaders have expressed interest in bet- 
ter understanding gender gaps within these two 
subgroups.'* 



Vermont education 
leaders have expressed 
a strong interest in 
learning how state 
and national gender 
gaps compare, in 
the aggregate and 
disaggregated by 
student poverty and 
disability status 



Because Vermont is held accountable under the 
NCFB Act for student performance levels based 
on NECAP scores, state education leaders have 
expressed interest in how the NECAP has been 
measuring student gender gaps and how these 
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gaps compare with those measured by other 
nationally recognized assessments like the NAEP, 
administered to nationally representative samples 
of students since the 1960s. ^ A recent comparison 
of the NAEP and individual state assessments con- 
cludes that state assessment standards and student 
outcomes differ greatly across the country (U.S. 
Department of Education, National Center for 
Education Statistics 2007). Neither that study, nor 
any other, however, has compared NECAP scores 
with NAEP scores. This report addresses these 
issues by examining how gender gaps compare in 
Vermont and nationally as well as on the NECAP 
and the NAEP. 



Research questions 

This report examines two questions of interest to 
Vermont state education leaders about reading, 
writing, and math achievement for students in 
grades 4 and 8: 



DO GENDER GAPS DIFFER 
SIGNIFICANTLY BETWEEN VERMONT 
AND U.S. NATIONAL ASSESSMENT OF 
EDUCATIONAL PROGRESS SCORES? 

In general, gender gaps in reading, writing, and 
math scores in Vermont and nationally do not 
differ, even after taking student poverty and dis- 
ability status into account. The following sections 
provide more detailed descriptions of gender gaps 
from NAEP scores in each jurisdiction and differ- 
ences between Vermont and national gender gaps 
for reading, writing, and math. 



Reading 

Students in grades 4 and 8 took the NAEP read- 
ing assessment in 2002, 2003, 2005, and 2007. 
This study examines gender gaps overall — and by 
poverty and disability status — for Vermont and 
nationally (table 1). 



1 . Do gender gaps differ significantly between 
Vermont’s NAEP scores and U.S. NAEP 
scores, overall and after controlling for stu- 
dent poverty and disability status? 

2. How do gender gaps differ between Vermont’s 
NECAP and Vermont’s NAEP scores, overall 
and after controlling for student poverty and 
disability status? 



In general, gender gaps 
in reading, writing, 
and math scores in 
Vermont and nationally 
do not differ, even 
after taking student 
poverty and disability 
status into account 



Researchers used the online database, NAEP Data 
Explorer (U.S. Department of Education, National 
Center for Education Statistics 2008), to retrieve 
publicly available data on NAEP scores in reading 
(2002, 2003, 2005, 2007), writing (2002), and math 
(2000, 2003, 2005, 2007) for grades 
4 and 8 in Vermont and nationally, 
and NECAP assessment results for 
2006 in grades 4 and 8, provided 
by the Vermont Department of 
Education (see box 2 and appendix 
A for details on study methods 
and limitations; see appendix B for 
descriptions of the NEAP and the 
NECAP). 



Gender gaps in Vermont and nationally. On aver- 
age, 4th and 8th grade girls consistently outscored 
boys on the 2002-07 NAEP reading assessments. 
During that period gender gaps among grade 4 
students ranged from 5 to 8 points in Vermont 
and 6 to 8 points nationally (see table 1). These 
test score gaps are statistically significant (statisti- 
cally greater than zero) for each year. Expressed 
in effect size units, average NAEP reading scores 
among grade 4 girls exceeded those of boys by 
0.17-0.26 standard deviation in Vermont and 
0.17-0.20 standard deviation nationally. And the 
average girl ranked above the average boy by 7-10 
percentiles in Vermont and 7-8 percentiles nation- 
ally (see appendix A for explanation of percentile 
difference calculations). 

Aggregate gender gaps in reading were slightly 
larger among older students. During 2002-07 
grade 8 girls outscored boys by 9-13 points in 
Vermont, and by 9-10 points nationally. In effect- 
size terms gender gaps ranged from 0.30 to 0.42 
standard deviation in Vermont and from 0.28 to 
0.30 nationally. Alternatively, the average grade 
8 girl outranked the average grade 8 boy on the 
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BOX 2 

Study methods and limitations 

To determine gender gaps in National 
Assessment of Educational Progress 
(NAEP) reading (2002, 2003, 2005, 
2007), writing (2002), and math 
(2000, 2003, 2005, 2007) assessment 
scores for students in grades 4 and 
8 in Vermont and the United States, 
researchers used the NAEP Data Ex- 
plorer online database (U.S. Depart- 
ment of Education, National Center 
for Education Statistics 2008). The da- 
tabase also provided Vermont NAEP 
reading and math results (2007), 
which were compared with NECAP 
results (2006)— provided by the 
Vermont Department of Education 
(Vermont Department of Education, 
Standards and Assessment, 2007) — to 
determine how different assessments 
measured gender gaps. Average NAEP 
and NECAP scores were examined 
by gender, poverty status (defined 
by eligibility in the National School 
Lunch Program), and disability status 
(defined by eligibility for an Individu- 
alized Education Program). 

To compare gender gaps in Vermont 
and nationwide, researchers calcu- 
lated the following outcome statistics 
from NAEP reading, writing, and 
math data: average scale scores for 
boys and girls (in the aggregate), 
gender gap estimates (measured as 
differences in average scale scores 
between girls and boys), effect sizes 
for gender gap estimates (in both 
standard deviation units and average 
percentile differences), and estimates 
of differences between Vermont and 
national gender gaps. Researchers 
disaggregated gender gaps by poverty 
and disability status and compared 



these gaps in Vermont and nation- 
wide. Gender gaps within jurisdic- 
tions and differences in gender gaps 
between jurisdictions were tested for 
statistical significance. 

To compare gender gaps measured by 
the Vermont NECAP and Vermont 
NAEP, researchers calculated the 
following outcome statistics from 
each assessment in reading and math: 
average scale scores for boys and 
girls (in the aggregate), gender gap 
estimates (measured as differences in 
average scale scores between girls and 
boys), and effect sizes for gender gap 
estimates (in both standard deviation 
units and average percentile differ- 
ences). Researchers disaggregated gen- 
der gaps by student poverty and dis- 
ability status. All cases where students 
took out-of-grade NECAP tests were 
excluded (for details on data exclusion 
see appendixes A and D). Gender gaps 
as measured by each assessment (both 
in the aggregate and by student pov- 
erty and disability status) were tested 
for statistical significance. 

Three limits of this study should be 
noted. First, because the NECAP and 
NAEP measure student achievement 
with different scoring scales, test score 
gaps were converted into standardized 
effect sizes to compare gaps. Although 
reporting confidence intervals around 
effect sizes is becoming a recom- 
mended practice (Gumming 2001; 

Nix and Barnette 1998; Steiger 2004; 
Thompson 1998), calculating the 
intervals was beyond the scope of this 
project. This report therefore does not 
indicate whether gender gaps mea- 
sured by the Vermont NECAP and 
NAEP assessments differ at statisti- 
cally significant levels. Instead, gender 



gaps from each assessment are pre- 
sented in standardized effect size units 
and the percentile equivalents to help 
readers form their own judgments. 

Second, this report does not compare 
gender gaps in writing scores from 
the Vermont NECAP and NAEP as- 
sessments. Data from each assessment 
were available for different grade-level 
cohorts only and were therefore not 
comparable. This report compares 
outcomes on the Vermont NECAP 
and NAEP reading and mathematics 
assessments only. Third, more precise 
and detailed analyses of gender gaps 
across the two jurisdictions require 
examining student-level NAEP micro- 
data, which are not available from the 
NAEP Data Explorer. Future studies 
may wish to explore these data to 
expand the analyses presented here. 

At best, this report points educa- 
tion leaders toward areas for further 
analysis and investigation. This report 
does not assign meaning to the relative 
scope of gender or other types of gaps 
in student achievement or explain the 
causes behind gender gaps in NAEP or 
NECAP data. And it does not suggest 
how to close gender gaps. Because the 
report measures statistical signifi- 
cance alone, it does not draw conclu- 
sions of the substantive and practical 
significance of results, which may vary 
by context (Light, Singer, and Willett 
1990). Having captured the interest of 
the education community, these topics 
now require ongoing study along with 
continued measurement of achieve- 
ment gaps over time. 

For additional details on study data, 
methods, and limitations, see ap- 
pendix A. 
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“Gender gap is statistically significant (different from zero) atp < 0.05. 

— is not available. 

A/ofe; Gender gap is defined as the mean scale score of girls minus the mean scale score of boys (f-M). Effect size is calculated asd = (F-M}/SDp„„,g^,whereSDp„„,g^ = ^[(SDp^ + SDnf^}/2]. Percentile difference 
measures differences between the average girl and average boy in the test score distribution. Shaded pairs of Vermont and national gender gap estimates are statistically different from each other at p < 0.05. 

Source; Authors' analysis based on grades 4 and 8 reading data for 2002, 2003, 2005, and 2007 from U.S. Department of Education, National Center for Education Statistics (2008). 
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NAEP test score distribution by 12-16 percentiles 
in Vermont and 11-12 percentiles nationally. 

When scores were disaggregated by student 
poverty and disability status, there were a few 
shifts in the differences between girls’ and boys’ 
reading scores in both jurisdictions. Gender gaps 
in reading shrank, or disappeared, for some stu- 
dent subgroups. Gender gaps among Vermont 4th 
graders in poverty were not statistically signifi- 
cant from 2002 to 2005. Among Vermont 4th and 
8th graders with disabilities gender gaps were not 
statistically significant through 2007. Displaying 
a similar pattern, national gender gaps were not 
statistically significant among 4th graders with 
disabilities in any year except 2003. Although 
national grade 8 gender gaps were statistically 
significant within all subgroups, the gaps were 
smaller among students with disabilities than 
among all 8th graders. 



Among 4th graders in poverty differences be- 
tween Vermont and national gender gaps were 
not statistically significant except in 2003, when 
the national gap exceeded the Vermont gap by 5.5 
points. This margin is statistically significant at a 
95 percent confidence level. Differences in gender 
gaps between the two jurisdictions were signifi- 
cant for 4th graders not in poverty only in 2005, 
when Vermont’s gap exceeded the national gap by 
a statistically significant margin. Because of the 
number of differences examined (39; see table A2 
in appendix A), it is possible that these two 
statistically significant findings may have arisen 
by chance. Indeed, the overall pattern of results 
shows that gender gaps in Vermont and nationally 
were very similar in grades 4 and 8 in reading, 
both before and after taking student poverty and 
disability status into account. 



Writing 



Differences between Vermont and national gender 
gaps. For all grades 4 and 8 students differences 
between Vermont and national gender gaps were 
not statistically significant. Gender gaps in read- 
ing among all Vermont 4th graders differed from 
those among all U.S. 4th graders by -2.1 to 0.8 
points from 2002 to 2007 (see table 1). Among 
grade 8 students gender gaps between Vermont 
and the country differed by -0.1 to 3.0 points. 
None of these differences was statistically sig- 
nificant at the p < 0.05 level. Thus, over 2002-07 
grades 4 and 8 girls outscored boys in reading by 
similar amounts in Vermont and nationally. 

Disaggregated by poverty and disability status, 
differences in gender gaps in each jurisdiction 
have rarely differed by statistically significant 
amounts from 2002 to 2007. For grade 8, differ- 
ences between Vermont and national gender gaps 
by student poverty subgroups are not statistically 
significant from 2002 to 2007. In both grades 
4 and 8 differences between Vermont and na- 
tional gender gaps for student subgroups with 
and without disabilities also are not statistically 
significant. 



Grades 4 and 8 students took the national NAEP 
writing assessment in 2002. This study examines 
gender gaps overall and by poverty and disability 
status (table 2). 



Gender gaps in Vermont and the United States. 
Girls outscored boys in Vermont and nationally 
on the 2002 NAEP writing assessment by greater 
margins than on the NAEP reading assessment. 
The grade 4 gender gap in writing was more than 
21 points in Vermont and 
almost 18 points nation- 
ally. Among grade 8 
students aggregate gender 
gaps were even larger — 
girls outperformed boys 
by more than 24 points in 
Vermont and almost 21 
points nationally. 



Girls outscored boys in 
Vermont and nationally 
on the 2002 NAEP 
writing assessment 
by greater margins 
than on the NAEP 
reading assessment 



In effect sizes average grade 4 gender gaps were 
0.61 standard deviation in Vermont and 0.50 stan- 
dard deviation nationally; average grade 8 gender 
gaps were larger, at 0.66 standard deviation in 
Vermont and 0.58 standard deviation nationally. 
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TABLE 2 

Grades 4 and 8 gender gaps and associated effect sizes in National Assessment of Educational Progress 
writing scores, by student poverty and disability status, Vermont and nationally, 2002 



Students 


Grade 4 


Grade 8 | 


Vermont 


National 


Vermont 


National I 


All 


Gender gap (points) 


21.3** 


17.5** 


24.2** 


20.9** 


Effect size (d) 


0.61 


0.50 


0.66 


0.58 


Percentile difference 


23 


19 


25 


22 


Students in poverty 










Gender gap (points) 


24.1** 


16.5** 


26.1** 


19.4** 


Effect size (d) 


0.72 


0.49 


0.70 


0.56 


Percentile difference 


26 


19 


26 


21 


Students not in poverty 


Gender gap (points) 


22.6** 


18.9** 


24.1** 


22.2** 


Effect size (d) 


0.67 


0.57 


0.70 


0.66 


Percentile difference 


25 


22 


26 


25 


Students with disabilities 


Gender gap (points) 


6.3 


9.6** 


7.4 


13.7** 


Effect size (d) 


0.21 


0.28 


0.22 


0.39 


Percentile difference 


8 


11 


9 


15 


Students without disabilities 


Gender gap (points) 


20.0** 


16.5** 


23.3** 


18.8** 


Effect size (d) 


0.61 


0.49 


0.68 


0.55 


Percentile difference 


23 


19 


25 


21 



**Gender gap is statistically significant (different from zero) at p < 0.05. 

Nofe.- Gender gap is defined as the mean scale score of girls minus the mean scale score of boys (F-M). Effect size is calculated as d = {F - M) / 

where SDp„oied = VKSD,^ + SDf/) / 2]. Percentile difference measures differences between the average girl and average boy in the test score distribution. The 

shaded pair of Vermont and national gender gap estimates is statistically different from each other at p < 0.05. 

Source: Authors' analysis based on grades 4 and 8 reading data for 2002 from U.S. Department of Education, National Center for Education Statistics (2008). 



In 2002 the average grade 4 girl outranked the 
average grade 4 boy by 23 percentiles in Vermont 
and 19 percentiles nationally, while the average 
grade 8 girl scored 25 percentiles higher than her 
male counterpart in Vermont and 22 percentiles 
higher nationally. 

Disaggregated by student poverty status gender 
gaps in writing changed very little by grade and 
jurisdiction. A different pattern emerged after 
disaggregating by disability status. Similar to the 
results for reading, gender gaps in writing in both 
Vermont and nationally were smaller for students 
with disabilities than for students in the aggregate. 



In Vermont the gender gaps for grades 4 and 8 
students with disabilities (6.3 and 7.4 points) in 
2002 were not statistically significant. Nationally, 
girls with disabilities outscored boys with dis- 
abilities by statistically significant margins in both 
grades, but the gender gap was about half the size 
of the aggregate gender gap among 4th graders 
and two -thirds the aggregate gender gap among 
8th graders. 

Differences between Vermont and national gender 
gaps. Aggregate gender gaps on the 2002 NAEP 
writing assessment were similar in Vermont and 
nationally. The gender gap in Vermont exceeded 
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the gap nationally by 3.8 points among 4th grad- 
ers and by 3.3 points among 8th graders. Neither 
margin was statistically significant. 

In general, Vermont and national gender gaps in 
writing were similar after controlling for student 
poverty and disability status. Among students 
not in poverty differences between Vermont and 
national gender gaps (3.7 points in grade 4 and 1.9 
points in grade 8) were not statistically significant. 
And within the subgroups of students with and 
without disabilities Vermont and national gender 
gaps never differed by statistically significant 
amounts. Of 10 total differences examined in writ- 
ing, there was only one exception. Among grade 
4 students in poverty the gender gap in Vermont 
exceeded the national gap by 7.6 points — a statisti- 
cally significant amount. 



Math 



Grades 4 and 8 students took the national NAEP 
math assessment in 2000, 2003, 2005, and 2007. 
This study examines gender gaps in math scores 
among all students and students in poverty and 
disability subgroups (table 3). 

Gender gaps in Vermont and nationally. Boys 
typically outperformed girls on the NAEP math 
assessment in both Vermont and nationally from 
2000 to 2007. The aggregate gender gap among 
4th graders ranged from 2 to 5 points (0.06 to 0.19 
standard deviation) in Vermont and from 2 to 3 
points (or 0.08 to 0.09 standard deviation) nation- 
ally. The average grade 4 boy outranked the aver- 
age grade 4 girl in math scores by 2-8 percentiles 
in Vermont and 3-4 percentiles nationally. 

In both jurisdictions grade 8 boys in the aggregate 
also outscored girls in math. But the gender gaps 
were smaller for 8th graders than for 4th graders, 
an opposite pattern to that found for reading and 
writing. 

In both grades and jurisdictions gender gaps in 
math change very little after students are disag- 
gregated by poverty status but show strong shifts 



after students are disag- 
gregated by disability 
status. Eor example, 

Vermont boys with dis- 
abilities outscored their 
female counterparts in 
math by 8-12 points in 
grade 4 and 10-15 points 
in grade 8 over the seven-year period. Expressed 
differently, the average Vermont boy with dis- 
abilities outranked his female counterpart by 
11-17 percentiles in grade 4 and 11-18 percentiles 
in grade 8. National gender gaps among students 
with disabilities were also larger than those for all 
U.S. students, particularly in grade 8 (see table 3). 

Differences between Vermont and national gender 
gaps. In general, aggregate gender gaps in math 
have not differed significantly across the two 
jurisdictions since 2000. An exception occurred 
in 2005, when grade 4 boys outscored girls by 
5.1 points in Vermont and 2.4 points nationally. 
Vermont’s gap exceeded the national gap by nearly 
3 points — a statistically significant amount. In all 
other years the differences between Vermont and 
national aggregate gender gaps were not statisti- 
cally significant in grades 4 or 8. 

After controlling for student poverty and disability 
status, gender gaps in math in Vermont and na- 
tionally remain similar. From 2000 to 2007 differ- 
ences between Vermont and national gender gaps 
ranged from 0 to 5 points in grade 4 and from 0 to 
7 points in grade 8 across all poverty and disabil- 
ity status subgroups (see table 3 and tables E8 and 
ElO in appendix E). Only one of the differences 
examined was statistically significant. In 2005 
the gender gap in math among grade 4 students 
without disabilities was significantly larger in 
Vermont than nationally. This result reflected the 
statistically significant finding (noted above) for 
all 4th graders in 2005, because in both jurisdic- 
tions populations of students without disabilities 
are relatively similar in size to the total student 
populations. Thus, of 38 differences between ju- 
risdictions examined in math, 2 were statistically 
significant (see table A2). 



Boys typically 
outperformed girls 
on the NAEP math 
assessment in both 
Vermont and nationally 
from 2000 to 2007 




TABLE 3 

Grades 4 and 8 gender gaps and associated effect sizes in National Assessment of Educational Progress (NAEP) math scores by poverty and 
disability status, Vermont and nationally, 2000, 2003, 2005, and 2007 
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**Gender gap is statistically significant (different from zero) atp < 0,05. 

— is not available. 

Note; Gender gap is defined as the mean scale score of girls minus the mean scale score of boys (f-M). Effect size is calculated asd = {F-M) / SOpooij.^, where SDpo„|j,d = V[(SDp^ + SDj,^)/2], Percentile difference 
measures differences between the average girl and average boy in the test score distribution. Shaded pairs of Vermont and national gender gap estimates are statistically different from each other atp <0.05. 

Source; Authors' analysis based on NAEP math data for grades 4 and 8 for 2000, 2003, 2005, and 2007 from U.S. Department of Education, National Center for Education Statistics (2008). 
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HOW DO GENDER GAPS DIFFER BETWEEN 
VERMONT'S NEW ENGLAND COMMON 
ASSESSMENT PROGRAM AND VERMONT'S 
NATIONAL ASSESSMENT OF EDUCATIONAL 
PROGRESS SCORES? 

In all but a few cases gender gaps in reading and 
math on the 2006 Vermont NECAP differed from 
those on the 2007 Vermont NAEP by 5 percen- 
tile points or less. There were two instances in 
both reading and math when gender gaps on the 
Vermont NECAP and NAEP differed by more than 
5 percentile points. In addition, girls’ advantage in 
reading scores was greater and boys’ advantage in 
math scores was smaller on the NECAP than on 
the NAEP in all but one case. 



Reading 

Two observations emerged from comparing 
gender gaps in the 2006 NECAP and 2007 NAEP 
reading scores. First, aggregate gender gaps on the 
2006 NECAP exceeded those on the 2007 NAEP 
by less than 5 percentile points. Gender gap ef- 
fect sizes in grade 4 reading scores were 0.32 on 
the 2006 Vermont NECAP and 0.22 on the 2007 
Vermont NAEP (figure 1). These statistics indicate 
that the average grade 4 girl outranked the average 
grade 4 boy in reading by 13 percentiles on the 

2006 NECAP and 9 percentiles on the 2007 NAEP. 
The average grade 8 girl outranked her male coun- 
terpart by 16 percentiles on the 2006 NECAP (0.40 
standard deviation) and 14 percentiles on the 2007 
NAEP (0.35 standard deviation; figure 2). 

Thus, gender gaps in reading were smaller on the 

2007 NAEP than on the 2006 NECAP by 4 percen- 
tile points for the grade 4 cohort and by 2 per- 
centile points for the grade 8 cohort. The reasons 
for the differences in gap sizes are not clear — the 
differences may have arisen from dissimilar 
NECAP and NAEP population samples, different 
test instruments, or a genuine shift in the gender 
gap over the school year. 

Second, after controlling for student poverty and 
disability status, Vermont gender gaps in reading 



FIGURE 1 

Grade 4 gender gap effect sizes and percentile 
differences for all students and by student 
poverty and disability status for the Vermont 
2006 New England Common Assessment Program 
and 2007 National Assessment of Educational 
Progress reading scores 

Effect size (d) 

■ 2006 New England Common Assessment Program 

■ 2007 National Assessment of Education Progress 

0.4 0.34 




All Students Students Students Students 
students in poverty not in with without 

poverty disabilities disabilities 



Nofe; Values in parentheses are percentile differences. Positive effect 
sizes indicate girls outperforming boys; negative effect sizes indicate 
boys outperforming girls. 

Source: Authors' analysis based on NAEP data from the NAEP Data 
Explorer (U.S. Department of Education, National Center for Education 
Statistics 2008) and authors' tabulations of student-level NECAP data 
(Vermont Department of Education, Standards and Assessment 2007). 

on the 2006 NECAP exceeded those on the 2007 
NAEP by 5 percentile points or less. Gender gaps 
in reading continued to be similar on the two 
assessments after grades 4 and 8 students were 
disaggregated into poverty status subgroups (see 
figures 1 and 2). For example, the average grade 
4 girl in poverty outperformed the average grade 
4 boy in poverty by 13 percentiles on the 2006 
NECAP and 8 percentiles on the 2007 NAEP. And 
the average grade 8 girl in poverty outperformed 
her male counterpart by 15 percentiles on the 2006 
NECAP and 13 percentiles on the 2007 NAEP. In 
both grades for students not in poverty the gender 
gaps measured by each assessment differed by 5 or 
fewer percentile points. 

A similar story emerged after grades 4 and 8 stu- 
dents were disaggregated by disability status. For 
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FIGURE 2 



Grade 8 gender gap effect sizes and percentile 
differences for all students and by student 
poverty and disability status for the Vermont 
2006 New England Common Assessment Program 
and 2007 National Assessment of Educational 
Progress student reading scores 
Effect size {d) 




All 

students 



Students 
in poverty 



Students 
not in 
poverty 



Students 

with 

disabilities 



Students 

without 

disabilities 



Wofe; Values in parentheses are percentile differences. Positive effect 
sizes indicate girls outperforming boys; negative effect sizes indicate 
boys outperforming girls. 

Source: Authors' analysis based on NAEP data from the NAEP Data 
Explorer (U.S. Department of Education, National Center for Education 
Statistics 2008) and authors' tabulations of student-level NECAP data 
(Vermont Department of Education, Standards and Assessment 2007). 



example, the average grade 4 girl with disabilities 
ranked below her male counterpart by 5 percen- 
tiles on the 2006 NECAP and 6 percentiles on the 
2007 NAEP (see figure 1). Among grade 8 students 
with disabilities the average girl ranked above the 
average boy by 2 percentiles on the 2006 NECAP 
and below the average boy by 3 percentiles on the 
2007 NAEP (see figure 2). In both these cases and 
those for students without disabilities gender gaps 
on the two assessments differed by no more than 5 
percentile points. 



5 or fewer percentile points on the 2006 NECAP 
and 2007 NAEP. Grade 4 girls performed better 
than grade 4 boys in math by 0.02 standard de- 
viation on the 2006 NECAP and worse than their 
male counterparts by 0.10 standard deviation on 
the 2007 NAEP (figure 3). Thus, the average grade 
4 girl in Vermont ranked 1 percentile higher than 
her male counterpart on the 2006 NECAP and 
4 percentiles lower than her male counterpart 
on the 2007 NAEP. Within the grade 8 cohort 
the average girl outscored the average boy by 3 
percentiles on the 2006 NECAP and 2 percentiles 
on the 2007 NAEP (figure 4). For both grade-level 
cohorts gender gaps differed between the two 
assessments by 5 or fewer percentile points. As 
with reading scores, girls’ performance relative 
to boys’ was stronger on the NECAP than on the 
NAEP. 



FIGURES 

Grade 4 gender gap effect sizes and percentile 
differences for all students and by student 
poverty and disability status for Vermont 2006 
New England Common Assessment Program 
and 2007 National Assessment of Educational 
Progress math scores 



Effect size (d) 

0-2 ■ 2006 New England Common Assessment Program 

■ 2007 National Assessment of Education Progress 




- 0.19 

(- 8 ) 



students in poverty 



Students 

with 



Students 

without 



poverty disabilities disabilities 



Math 

Two observations emerged from comparisons of 
gender gaps on the 2006 NECAP and 2007 NAEP 
math assessments in Vermont. First, aggregate 
gender gaps in Vermont math scores differed by 



Nofe; Values in parentheses are percentile differences. Positive effect 
sizes indicate girls outperforming boys; negative effect sizes indicate 
boys outperforming girls. 

Source: Authors' analysis based on NAEP data from the NAEP Data 
Explorer (U.S. Department of Education, National Center for Education 
Statistics 2008) and authors' tabulations of student-level NECAP data 
(Vermont Department of Education, Standards and Assessment 2007). 
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FIGURE 4 

Grade 8 gender gap effect sizes and percentile 
differences for all students and by student 
poverty and disability status for Vermont 2006 
New England Common Assessment Program 
and 2007 National Assessment of Educational 
Progress math scores 

Effect size {d) 

0.2 B 2006 New England Common Assessment Program 

■ 2007 National Assessment of Education Progress 

0.11 0.10 




- 0.30 

(- 12 ) 



All Students Students Students Students 

students in poverty notin with without 

poverty disabilities disabilities 

Wofe; Values in parentheses are percentile differences. Positive effect 
sizes indicate girls outperforming boys; negative effect sizes indicate 
boys outperforming girls. 

Source: Authors' analysis based on NAEP data from the NAEP Data 
Explorer (U.S. Department of Education, National Center for Education 
Statistics 2008) and authors' tabulations of student-level NECAP data 
(Vermont Department of Education, Standards and Assessment 2007). 

Second, after controlling for poverty and disability 
status, Vermont gender gaps differed by 5 or fewer 
percentile points for 4th graders, but by more than 5 
percentile points among subgroups of 8th graders on 
the 2006 NECAP and 2007 NECAP math assess- 
ments. Among grade 4 students in poverty there 
was no difference in average math scores for boys 
and girls on the 2006 NECAP, while the average girl 
scored 4 percentiles below the average boy on the 
2007 NAEP (see figure 3). Among grade 4 students 
who were not in poverty the average girl ranked 
below the average boy in math by 1 percentile on the 
2006 NECAP and 4 percentiles on the 2007 NAEP. 

In both cases the size of the gender gap on each as- 
sessment differed by 5 percentile points or less. 

A different pattern emerged for 8th graders after 
math scores were disaggregated by student poverty 



status. Among grade 8 students who were not in 
poverty, the average girl outscored the average 
boy by 4 percentiles on the 2006 NECAP and 1 
percentile on the 2007 NAEP, for a difference in 
gender gaps of 3 percentile points (see figure 4). In 
contrast, the average grade 8 girl in poverty scored 
above her male counterpart by 4 percentiles on the 
2006 NECAP and below her male counterpart by 
8 percentiles on the 2007 NAEP, for a difference 
in gender gaps of 12 percentile points — a differ- 
ence between assessments four times larger for 8th 
graders in poverty than for those not in poverty. 

With math scores disaggregated by student dis- 
ability status, students in grades 4 and 8 again 
displayed contrasting gender gaps on the two 
assessments. Gender gaps in math on the 2006 
NECAP and 2007 NAEP differed by 5 or fewer 
percentiles for grade 4 students with or without 
disabilities (see figure 3 and table F3 in appendix 
E). Among grade 4 students with disabilities the 
underperformance of girls compared with boys 
was greater on the 2006 NECAP than on the 2007 
NAEP — the only instance when the gender gaps 
on the two assessments compared in this way. In 
contrast, gender gaps on the two assessments di- 
verged by larger margins among grade 8 students 
after controlling for disability status. Among grade 
8 students with disabilities the average girl scored 
below the average boy by 5 percentiles on the 2006 
NECAP and 12 percentiles on the 2007 NAEP — 
a 7 percentile point difference between the two 
assessments. 



FINAL OBSERVATIONS AND 
AREAS FOR FUTURE RESEARCH 

Consistent with previous research this study of 
publicly available NAEP data shows that over 
2000-07 girls in Vermont and the country as 
a whole consistently outscored boys in read- 
ing and writing, while boys typically outscored 
girls in math. This examination of gender gaps 
in Vermont and U.S. NAEP scores and in recent 
Vermont NECAP scores suggests areas for further 
investigation. 
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The first research question asks whether gen- 
der gaps on the NAEP have varied significantly 
between Vermont and the country as a whole. The 
difierences between Vermont and national gender 
gaps were statistically significant in only a few 
instances: 



• In reading scores (in which girls consistently 
outscored boys), the gender gap was signifi- 
cantly smaller in Vermont than nationally in 
2003 for grade 4 students in poverty and sig- 
nificantly larger in Vermont than nationally 
in 2005 for grade 4 students not in poverty 
(see table 1). 



• In writing scores (in which girls consistently 
outscored boys), the gender gap in 2002 for 
grade 4 students in poverty was significantly 
larger in Vermont than nationally (see table 2). 

• In math scores (in which boys typically 
outscored girls), the gender gaps in 2005 for 
grade 4 students in the aggregate and without 
disabilities were significantly larger in Ver- 
mont than nationally (see table 3). 



Vermont and national 
gender gaps differed 
significantly in only 5 
percent of all differences 
examined for reading 
and math scores 
and 10 percent of all 
differences examined 
for writing scores 



Vermont and national gender gaps differed signifi- 
cantly in only 5 percent of all differences exam- 
ined for reading and math scores and 10 percent 
of all differences examined for writing scores 
(see table A2 in appendix A). Of 87 total differ- 
ences tested across all three content areas, 5 cases 
emerged as statistically significant. Because of the 
total number of comparisons made, it is possible 
that some of these cases arose by chance. For 
instance, when results are reported at a 95 percent 
confidence level, approximately 5 
percent will be statistically signifi- 
cant due simply to chance. Thus, 
it is difficult to tell whether these 
cases reflect underlying trends or 
statistical anomalies. 



Because gender gaps in reading 
and math were not consistently 
larger in one jurisdiction than 
in the other, the isolated cases 



in which differences were statistically significant 
would not appear to signal discernable trends. By 
contrast, Vermont gender gaps in writing as- 
sessment scores appear to exceed national gaps 
consistently among both grades 4 and 8 students 
in the aggregate and after controlling for poverty 
status. Only one comparison yielded a statistically 
significant result, however, and the data examined 
were for one year. Future research could examine 
whether Vermont and national gender gap differ- 
ences in reading, writing, and math in particular 
extend into subsequent years. 

In both Vermont and nationally gender gaps in 
writing have been larger than those in reading and 
math. Consistent with long-term trends, Vermont 
and national gender gaps on the NAFP writing as- 
sessment were larger than gender gaps in reading 
and much larger (and in the opposite direction) 
than those in math (figure 5). Future research 
could examine these gaps in greater depth due to 
their scale and recorded persistence over time. 
Such research could explore whether gender gaps 

FIGURES 

National Assessment of Educational Progress 

gender gaps in grades 4 and 8, by content area 

and grade (points) 



Difference in average scale scores (F-M) 




Grade 4 Grades 

Wofe; Writing and reading data are from 2002; math data are from 2003. 

Source: Authors' analysis based on NAEP data from the NAEP Data 
Explorer (U.S. Department of Education, National Center for Education 
Statistics 2008). 
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vary by other individual and social factors beyond 
poverty and disability status (such as race/ethnic- 
ity, age, and parent or teacher education levels). 
Studies could also prohe whether gender gaps 
shift under different testing conditions — such as 
writing by hand or computer, or writing fiction or 
nonfiction. Further insight could also be gained 
by examining current gender gaps in writing at 
different points in the overall score distribution — 
such as at the mean and the upper and lower ex- 
tremes. Such efforts would explore how differences 
between girls’ and boys’ scores vary by writing 
ability level, both in Vermont and nationally. 

In both jurisdictions gender gaps in reading 
and writing tend to be larger in grade 8 than in 
grade 4. This pattern is consistent with findings 
from previous studies of NAEP scores (Freeman 
2004). In the aggregate the reverse pattern appears 
to hold in math. From 2003 to 2007 gender gaps in 
both jurisdictions were slightly smaller in grade 8 
than in grade 4. Both these patterns may reflect 
the same phenomenon — as students advance in 
grade level, scores may increase more for girls 
than for boys. A larger dataset — representing 
more grades over a longer time — could test this 
hypothesis, both in the aggregate and controlling 
for student background variables. If girls’ scores 
increase more than boys’ scores for at least some 
student groups as they progress through school, it 
is unclear why. Might some girls develop academi- 
cally at a faster rate than boys do? Or might some 
girls learn to become better test-takers faster than 
boys do? If either of these propositions is true, do 
they hold equally for high-scoring and low-scoring 
students? All these questions might merit further 
investigation. 

Gender gaps were smaller in reading and writing, 
and larger in math, after controlling for disability 
status. In both jurisdictions girls achieved higher 
average scale scores than did boys on the NAEP 
reading and writing assessments. After controlling 
for disability status, however, gender gaps in read- 
ing were smaller by as much as 100 percent for stu- 
dents with disabilities and 25 percent for students 
without disabilities. In writing gender gaps were 



as much as 100 percent 
smaller for students with 
disabilities and 10 percent 
smaller for students with- 
out disabilities. In math 
gender gaps were as much 
as 400 percent larger for 
students with disabilities 
and 60 percent larger 
for students without 
disabilities. 



Because gender gaps 
in all three content 
areas shift markedly 
after students are 
disaggregated by 
disability status, the 
question emerges 
whether boys and 
girls are being evenly 
distributed into disability 
status categories: further 
research is necessary 
to better understand 
these gender gap shifts 



Because gender gaps in 
all three content areas 
shift markedly after stu- 
dents are disaggregated 

by disability status, the question emerges whether 
boys and girls are being evenly distributed into 
disability status categories. Girls achieve higher 
average reading scores than boys in the total 
Vermont and national populations; this gender gap 
can shrink for students with disabilities if boys 
with Individualized Education Programs tend 
to have relatively high reading scores, while girls 
with Individualized Education Programs tend 
to have very low reading scores. Different score 
distributions among boys and girls with Individu- 
alized Education Programs can also underlie the 
larger gender gaps found in math after disaggre- 
gating by disability status. Further research into 
score distributions by gender and disability status 
is necessary to better understand these gender gap 
shifts. 



With only a few exceptions gender gaps in 
Vermont on the 2006 NECAP have consistently 
differed from gender gaps on the 2007 NAEP by 
5 percentile points or fewer in reading and math. 
This result was remarkably consistent except in 
two cases for grade 8 math scores. Among 8th 
graders in poverty boys outperformed girls in 
math on the 2007 NAEP by 8 percentiles, and 
girls outperformed boys on the 2006 NECAP 
by 4 percentiles, for a 12 percentile point dif- 
ference between gender gaps. Girls performed 
better relative to boys on the NECAP than on the 
NAEP. Among 8th graders with disabilities boys 
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outperformed girls by 12 percentiles on the 2007 
NAEP and 5 percentiles on the 2006 NECAP, for 
a 7 percentile point difference. And girls again 
performed better relative to boys on the NECAP 
than on the NAEP. 

These cases underscore another consistent find- 
ing: in all but one case the degree to which girls 
outperformed boys in reading and neared boys’ 
performance in math was greater on the 2006 
NECAP than on the 2007 NAEP. The one excep- 
tion was in math among grade 4 students with 
disabilities; boys outscored girls by 11 percentiles 



on the 2007 NAEP and 16 percentiles on the 2006 
NECAP. Future research might examine why 
gender gaps on the NECAP appear to favor girls — 
especially among grade 8 students in poverty or 
with disabilities. 

Finally, observations from both NECAP and NAEP 
data show that poverty and disability gaps in test 
scores are much larger than gender gaps. More 
detailed examination of poverty and disability 
gaps, both before and after controlling for other 
background student characteristics, might be 
interesting avenues for further research. 



APPENDIX A 
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APPENDIX A 

DATA SOURCES, RESEARCH 

METHODS, AND LIMITATIONS 

This section describes the study’s data sources, 
research methods, and limitations. National As- 
sessment of Educational Progress (NAEP) data were 
retrieved from the NAEP Data Explorer database 
(U.S. Department of Education, National Center 
for Education Statistics 2008) and New England 
Common Assessment Program (NECAP) data were 
provided by the Vermont Department of Education 
(Vermont Department of Education, Standards and 
Assessment 2007). Researchers examined the data 
to measure gender gaps, differences between gender 
gaps, and the statistical significance of gender gaps 
and differences between gender gaps. Analysis of 
data was limited by the constraints of publicly avail- 
able data, the difficulties of comparing data from 
different assessments, and the focus of the study. 



Data sources 

Researchers used the NAEP Data Explorer online 
database to determine gender gaps in publicly 
available NAEP reading (2002, 2003, 2005, 2007), 
writing (2002), and math (2000, 2003, 2005, 2007) 
assessment scores for 4th and 8th graders in Ver- 
mont and nationally. The NAEP Data Explorer also 
provided Vermont NAEP reading and math results 
(2007), which researchers compared with NECAP 
results (2006) (Vermont Department of Education, 
Standards and Assessment 2007) to determine 
how different assessments measured gender gaps. 

Average scale NAEP and NECAP scores were 
examined by gender both before and after disag- 
gregating by poverty status (defined by eligibil- 
ity in the National School Lunch Program) and 
disability status (defined by eligibility for an 
Individualized Education Program). At every 
grade level NAEP scale scores range from 0 to 500 
in reading and math and from 0 to 300 in writing. 
The NECAP is scored with a range of 80 points for 
each grade. In grade 4 scores range from 400 to 
480, in grade 5 from 500 to 580, in grade 8 from 
800 to 880, and so on. 



Occasionally, a student in one grade will be admin- 
istered the NECAP test for a lower level grade, re- 
sulting in a score that was scaled for a lower grade. 
All these cases — a small proportion of the total — 
were excluded from the analysis. Exclusion rates 
differed very little across student subgroups (for 
details on the size of the original NECAP data sets 
and exclusion rates, see table D1 in appendix D). 

Comparison years for the New England Common 
Assessment Program and National Assessment of 
Educational Progress in reading and math. This 
project had access to student-level data from the 
Vermont NECAP reading and math assessments 
administered in the fall (October) of 2005 and 
2006. Aggregate data for the most recent NAEP 
reading and math assessments, available through 
the NAEP Data Explorer, were for tests adminis- 
tered in the winter and spring (January through 
March) of 2005 and 2007. 

This report compares gender gaps in reading and 
math from the NECAP administered in fall 2006 
and the NAEP administered in early 2007. Stu- 
dents in grades 4 and 8 who took assessments in 
these years were from the same grade-level cohorts 
(table Al, cohorts A and E). Although this proj- 
ect had access to the 2005 Vermont NECAP and 
NAEP scores, comparing these two assessments 
was problematic and was therefore not conducted. 

Specifically, a comparison of grade 4 scores on 
the 2005 NAEP with grade 4 scores on the 2005 
NECAP would have compared outcomes from 
different student cohorts (see table Al). Differences 
in gender gaps from these comparisons could be 
due to variation between student cohorts as well as 
the two assessments. Alternatively, grade 4 scores 
on the 2005 NAEP and grade 5 scores on the 2005 
NECAP are from the same student cohort, but dif- 
ferences between these two sets of scores could be 
due not only to variation in the two assessments 
but to the change in student grade levels (see 
cohort C in table Al). 

To be clear, differences in gender gaps found from 
NAEP and NECAP tests administered to students 
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TABLE A1 

Vermont reading and math assessments 
for the National Assessment of Educational 
Progress (NAEP) and the New England Common 
Assessment Program (NECAP) by student cohort 
and grade levels, 2004-07 



Student 

cohort 


2004/05 


2005/06 


2006/07 


2005 

NAEP" 


2005 

NECAPS 


2006 

NECAP'’ 


2007 

NAEP" 


Cohort A 




Grade 3 


Grade 4 


Grade 4 


Cohort B 




Grade 4 


Grade 5 




Cohort C 


Grade 4 


Grade 5 


Grade 6 




Cohort D 




Grade 6 


Grade 7 




Cohort E 




Grade 7 


Grade 8 


Grade 8 



Note: Shaded gender gap data were examined and compared in this 
report. The NAEP was administered from January through March in 2005 
and 2007 and was not administered in 2006. The NAEP is administered 
only to students in grades 4, 8, and 12. The NECAP was administered in 
October of 2005 and 2006 to students in grades 3 through 8. 

Source: U.S. Department of Education, National Center for Education 
Statistics (2008) and Vermont Department of Education, Standards and 
Assessment (2007). 

from the same grade level and cohort cannot be 
attributed entirely to differences between the 
two tests. Other factors associated with each 
assessment could contribute to different student 
outcomes. Because student cohorts and grade 
levels are likely to be related to student outcomes, 
however, comparisons of NAEP and NECAP data 
without controlling for these factors were avoided. 

Comparisons between New England Common As- 
sessment Program and National Assessment of Edu- 
cational Progress writing scores. The NAEP writing 
assessment was last administered to students in 
grades 4 and 8 in Vermont and across the country 
in 2002. Meanwhile, the NECAP writing assess- 
ment has been administered to students in grades 
5 and 8 in Vermont since 2005. Comparisons of 
writing scores from the 2002 NAEP and 2005 or 
2006 NECAP would have involved comparisons 
between different tests, student cohorts, and grade 
levels. Thus, gender gaps in writing measured by 
the two different assessments were not compared 
for this report. Instead, Vermont gender gaps in 
writing were calculated from the 2006 NECAP and 
are presented separately in table F2 in appendix F. 



Research methods 

Comparing gender gaps in Vermont New England 
Common Assessment Program and U.S. National 
Assessment of Educational Progress scores. Two 
sets of differences were calculated and tested for 
statistical significance: differences between girls’ 
and boys’ NAEP scores (gender gaps) within each 
jurisdiction and differences between Vermont and 
national gender gaps. 

Differences between girls’ and boys’ National As- 
sessment of Educational Progress scores (gender 
gaps) within each jurisdiction. Gender gaps were 
calculated as the mean scale score of girls minus 
the mean scale score of boys (P - M) in each 
subject. T-tests were conducted using the NAEP 
Data Explorer significance-testing tool to deter- 
mine whether gender gaps (among all students 
and within poverty and disability subgroups) in 
each jurisdiction were significantly different from 
zero at thep < 0.05 level. In total, 177 gender gaps 
(79 in reading, 20 in writing, and 78 in math) were 
tested for statistical significance in the two juris- 
dictions (table A2). 

To provide readers with a sense of the scale of 
calculated gender gaps, differences in average scale 
scores between girls and boys were presented two 
additional ways: 

• Effect sizes. Gender gaps in each jurisdic- 

tion, subject, and student subgroup were also 
calculated as effect sizes to provide a stan- 
dardized measure of the difference between 
girls’ and boys’ mean scale scores. Specifically, 
gender gap estimates (measured in NAEP 
scale score units) were transformed into effect 
sizes (measured in standard deviation units) 
using Cohen’s d. Effect size was calculated as 
\d\ = {Mj - M 2 ) ! Opooied_ where Mj is the first 
mean, M 2 is the second mean, and is the 
pooled standard deviation for the distribution 
around each mean, or apg„i^^ = -e 02 ^) / 2]. 
The “effect” of being a girl within the NAEP 
and NECAP was calculated by taking the 
difference between average girls’ and boys’ 




APPENDIX A 



19 



TABLE A2 

Total differences examined and number of statistically significant differences in grades 4 and 8 for Vermont 
and U.S. National Assessment of Educational Progress reading, writing, and math scores, 2000-07 







Number 






Percentage 




Differences 


Reading 

(2002-07) 


Writing 

(2002) 


Math 

(2000-07) 


Reading 

(2002-07) 


Writing 

(2002) 


Math 

(2000-07) 


Gender gaps (differences between mean girls' and boys' scores) 


Total gender gaps examined 


79 


20 


78 


100.0 


100.0 


100.0 


Statistically significant gender gaps 


66 


18 


51 


83.5 


90.0 


65.4 


Vermont 


All students 


8 


2 


3 


10.1 


10.0 


3.8 


Students by poverty status 


13 


4 


4 


16.5 


20.0 


5.1 


Students by disability status 


8 


2 


10 


10.1 


10.0 


12.8 


National 


All students 


8 


2 


5 


10.1 


10.0 


6.4 


Students by poverty status 


16 


4 


13 


20.3 


20.0 


16.7 


Students by disability status 


13 


4 


16 


16.5 


20.0 


20.5 


Differences between Vermont and national gender gaps 


Total differences between gaps 














examined 


39 


10 


38 


100.0 


100.0 


100.0 


Differences between statistically 














significant gaps 


2 


1 


2 


5.1 


10.0 


5.3 


All students 


0 


0 


1 


0.0 


0.0 


2.6 


Students by poverty status 


2 


1 


0 


5.1 


10.0 


0.0 


Students by disability status 


0 


0 


1 


0.0 


0.0 


2.6 



Note: Statistical significance (significantly different from zero) is defined as p < 0.05. 

Source: Authors' analysis based on NAEP data from NAEP Data Explorer (U.S. Department of Education, National Center for Education Statistics 2008). 



mean scale scores within each assessment and 
dividing this difference {F - M, the test score 
gender gap) by the pooled standard deviation 
for boys’ and girls’ scores. The same meth- 
odology was used to compare the effect of 
being a student not in poverty with a student 
in poverty (NP - P), and a student without 
disabilities with a student with disabilities 
(ND - D). 

• Percentile differences. To aid in the interpreta- 
tion of effect sizes, Cohen’s d statistics were 
transformed into Cohen’s statistics, and 
calculations were performed to translate 
the gender gap into a percentile difference 
between the average girl and the average boy 
in the test score distribution. 



First, Cohen’s Uj statistics were determined 
by using a z-score table to find the area 
under the standard normal curve below 
the value of each gender gap effect size. For 
example, the statistic for a gender gap of d 
- 0.22 is 0.59. If, on average, girls outscored 
boys by 0.22 standard deviation on a spe- 
cific assessment, the average girl would have 
ranked at the 59th percentile of the boys’ 
score distribution. 

Second, percentile differences between girls’ 
and boys’ scores were calculated as the per- 
centile associated with the statistic, minus 
50. Following the example above, the average 
boy would have achieved a score at the 50th 
percentile, assuming scores were normally 
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distributed. Consequently, the average girl 
would have achieved a score placing her 9 
percentiles higher than the average boy. 

This approach assumes that scores are 
normally distributed and that the standard 
deviations of scores for both groups are simi- 
lar. Inspections of student-level NECAP score 
distributions showed reading, writing, and 
math scores to be normally distributed for the 
grade levels examined in this report. Standard 
deviations for boys’ and girls’ scores were 
generally quite similar on both the NECAP 
and NAEP (see appendixes E and F). 

NAEP score distributions can be determined 
by examining the raw data (Mislevy, Johnson, 
and Muraki 1992). NAEP score distributions 
are not available, however, through the NAEP 
Data Explorer. Because the project had access 
to NAEP data only through the NAEP Data 
Explorer, this report assumes that boys’ and 
girls’ NAEP scores are normally distributed 
when calculating t/j from the available NAEP 
data. Ultimately, the calculations performed 
and the distributional assumptions made do 
not allow this report to provide precise deter- 
minations of percentile differences between 
boys and girls; instead, alternative measures 
of gender gaps are provided to give readers an 
approximate gauge for the size and import of 
calculated gender gap effect sizes. 

Differences between Vermont and national gender 
gaps. The difference between each pair of Ver- 
mont and national gender gaps was calculated 
as the Vermont gap minus the national gap 
{VTgap - U.S.gap). A series of calculations were 
then performed to determine whether differences 
between Vermont and national gender gaps were 
statistically significant. 

First, the standard error for each gender gap 
estimate was calculated as 

SEp_f^~ ^{SEp^ + SEjff), where SEp and SE,^ were 
the standard errors for girls’ and hoys’ mean scale 



score estimates (downloaded from the NAEP Data 
Explorer database). 

Second, the standard error for the difference 
between each pair of Vermont and national 
gender gaps {SEvp^^p _ u.s.gap) was derived using 
the standard errors calculated for the individual 
Vermont and national gender gaps. The formula 

used was SEyj^^p _ u ^ gap ^(.^^VTgap ^^U.S.gap )> 

where SEypgap- SEp _ Min Vermont, and 
SEu.S.gap - SEp _Min the country. 

Third, t-scores were calculated for the differ- 
ences between Vermont and U.S. gender gaps to 
determine whether these differences were statis- 
tically significant. These scores were calculated 
hrgap- U.S.gap - (VTgap - U.S.gap) / SEyp^^p - U.S.gap. 
Because the Vermont and U.S. NAEP samples were 
large and viewed as independent, the difference 
between Vermont and national gender gaps was 
considered statistically significant if the absolute 
value of typgap - U.S.gap was greater than 1.96.*’ This 
figure is the minimum standardized score that al- 
lows one to reject, at a 95 percent level of certainty, 
the null hypothesis that means (and differences 
between means) from two large and independent 
samples are equal. 

In total, 87 differences in gender gaps (39 in read- 
ing, 10 in writing, and 38 in math) were tested for 
statistical significance between the two jurisdic- 
tions (see table A2). Differences between Vermont 
and national gender gaps were described in both 
scale score and percentile points. 

Comparing gender gaps in Vermont New England 
Common Assessment Program and Vermont Na- 
tional Assessment of Educational Progress scores. 
Gender gaps in reading and math scores were 
calculated from the 2006 Vermont NECAP and 
compared with gender gaps in reading and math 
scores from the 2007 Vermont NAEP. Students 
who took the 2006 NECAP and 2007 NAEP read- 
ing and math tests were from the same grade-level 
cohorts (see table Al, cohorts A and E). Writing 
scores from the Vermont NAEP and NECAP were 
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not compared because data from common grade- 
level cohorts were not available. 

The same statistics derived from the NAEP data — 
average scale scores, standard deviations, and stan- 
dard errors — by gender (both in the aggregate and 
within student poverty and disability subgroups) 
were calculated on the 2006 Vermont NECAP data 
in reading and math. Researchers conducted t-tests 
to determine whether differences in the average 
scale scores of girls and boys were statistically sig- 
nificant atp < 0.05. Researchers did not measure the 
statistical significance of differences between gen- 
der gaps on the NAEP and NECAP. Instead, stan- 
dardized effect sizes (Cohen’s d) and their percentile 
equivalents were calculated for all gap estimates to 
compare gender gaps on the two assessments. 



Limitations of the study 

Constraints associated with publicly available 
NAEP data, the difficulties of comparing outcomes 
from different assessments, and the bounded 
nature of this empirical project placed limitations 
on the NAEP and NECAP analyses and the conclu- 
sions to be drawn. 

This report does not indicate whether gender gaps 
measured by the Vermont NECAP and NAEP 
assessments differ from each other at statistically 
significant levels. The NECAP and NAEP measure 
student achievement with different scales; test 
score gaps were converted into standardized effect 



sizes to facilitate comparisons between the two 
assessments. Although the reporting of confi- 
dence intervals around effect sizes is becoming a 
recommended practice (Cumming 2001; Nix and 
Barnette 1998; Steiger 2004; Thompson 1998), 
calculating these intervals was beyond the scope 
of this project. Instead, gender gaps from each as- 
sessment are presented in standardized effect size 
units and their percentile equivalents to help read- 
ers form their own judgments about the extent and 
importance of gender gap differences. 

Direct contrasts between Vermont NECAP and 
NAEP writing scores were not presented because 
available data from each assessment were drawn 
from different grade-level cohorts and were 
therefore not comparable. This report therefore re- 
sponds to the project’s second research question by 
comparing outcomes from the Vermont NECAP 
and NAEP assessments in reading and math only. 

Einally, results are presented in terms of statistical, 
rather than substantive or practical, significance. 
In statistical tests two estimates are assumed to be 
equivalent and are considered different at statisti- 
cally significant levels if the size of their difference 
could occur by chance with a probability of less 
than 5 percent {p < 0.05). Statistical significance 
is different from substantive or practical signifi- 
cance, which may vary by context (Light, Singer, 
and Willett 1990). This report does not make any 
claims about the substantive or practical signifi- 
cance of results. 
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APPENDIX B 

THE VERMONT NATIONAL ASSESSMENT OF 
EDUCATIONAL PROGRESS AND THE NEW 
ENGLAND COMMON ASSESSMENT PROGRAM 

This appendix describes the National Assessment 
of Educational Progress (NAEP) and the New Eng- 
land Common Assessment Program (NECAP). 



The National Assessment of Educational Progress 

The NAEP, also known as “The Nation’s Report 
Card,” consists of several ongoing assessment 
programs that are administered by the National 
Center for Education Statistics. The main national 
NAEP program has been testing students across 
the country in reading, writing, math, science, 
and four other content areas since 1969. The state 
NAEP program, which uses the same assessments 
as the national program, has been conducted on 
a voluntary basis by many states and jurisdic- 
tions since 1990. Under the No Child Left Behind 
(NCLB) Act, all states receiving Title I funds 
are now mandated to participate in the national 
assessment. The same is true for school districts 
receiving Title I funds. Participation in the NAEP 
is voluntary, however, for schools and students. 

Through the NAEP program student achievement 
has been measured in grades 4 and 8 at the state 
level and in grade 12 at the national level. Reading 
and math assessments were administered every 
two to four years from 1990 to 2002, and the writ- 
ing assessment was administered in grades 4 and 
8 in 2002. Since 2003 reading and math assess- 
ments have occurred every other year. Vermont 
has participated in the state NAEP program since 
the mid-1990s.^ Because the NAEP contains both 
state- and national-level data, NAEP data can be 
used to compare assessment results among differ- 
ent states and the country as a whole. 

The NAEP does not test all students. Instead, 
NAEP samples a representative portion of popula- 
tions within individual states and nationwide. 

The U.S. NAEP sample combines subsets of the 
students who take the individual state NAEP with 



a sample of students in nonparticipating states 
to create a nationally representative sample. The 
NAEP oversamples underrepresented popula- 
tions (such as minorities and students from rural 
areas) and uses weighting procedures to generate 
samples that are representative of states and the 
country. 

Each student who participates in the NAEP 
receives a portion of all the questions on the 
assessment (roughly 25 percent of the total). 

NAEP scores for individual students are therefore 
estimated by generating and then tallying plau- 
sible values for each test item, based on the overall 
performance of students with similar background 
characteristics and cognitive performance levels. 

Because students’ NAEP scores are estimates 
based on plausible values, it is not possible to 
know how any one student performed on an entire 
NAEP. The design of the test makes it possible to 
determine overall score estimates for groups of 
students but not for individual students. In ad- 
dition, it is not possible to determine how many 
students took a specific assessment. Although the 
NAEP Data Explorer provides standard deviation 
and standard error estimates for student subgroup 
scores, student sample sizes calculated from these 
estimates are difficult to interpret because of the 
NAEP’s sample weighting procedures. For more 
information on the NAEP, see http://nces.ed.gov/ 
nationsreportcard/about/. 



The New England Common Assessment Program 

The NECAP is administered to all public school 
students in Vermont in grades 3-8. NECAP as- 
sessments were first used in Vermont in 2005. 
Developers of the NECAP (described as “The 
New England Common Test Program” in techni- 
cal documentation) explain the purpose of the 
program as follows: “The New England Common 
Test Program (NECAP) is the result of collabora- 
tion among New Hampshire (NH), Rhode Island 
(RI), and Vermont (VT) to build a set of tests for 
grades 3 through 8 to meet the requirements of the 
No Child Left Behind Act (NCLB). The purposes 
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of the tests are as follows: (1) Provide data on stu- 
dents’ achievement in reading/language arts and 
mathematics to meet the requirements of NCLB; 
(2) provide information to support program evalu- 
ation and improvement; and (3) provide to parents 
and the public information on the performance of 



students and schools. The tests are constructed to 
meet rigorous technical criteria, include universal 
design elements and accommodations so that stu- 
dents can access test content, and gather reliable 
student demographic information for accurate 
reporting.” (Measured Progress 2007, p. 5). 
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APPENDIX C 

STUDENT DEMOGRAPHICS FOR VERMONT 
AND THE COUNTRY, 2005/06 



TABLE Cl 

Characteristics of student population in Vermont and the country, 2005/06 





Number 


Percent 


Characteristic 


Vermont 


National 


Vermont 


National® 


Total 


96,638 


963,009 






Gender 


Male 


49,861 


489,697 


51.6 


50.9 


Female 


45,961 


462,948 


47.6 


48.1 


Race/ethnicity 


Asian 


1,496 


43,957 


1.5 


4.6 


Black, non-Hispanic 


1,424 


164,252 


1.5 


17.1 


White, non-Hispanic 


91,528 


544,233 


94.7 


56.5 


Other 


417 


11,660 


0.4 


1.2 


Hispanic 


957 


189,047 


1.0 


19.6 


Grade 


PreK 


4,061 


20,323 


4.2 


2.1 


Kindergarten 


6,069 


70,969 


6.3 


7.4 


1 


6,441 


72,369 


6.7 


7.5 


2 


6,429 


70,713 


6.7 


7.3 


3 


6,486 


70,315 


6.7 


7.3 


4 


6,549 


70,147 


6.8 


7.3 


5 


6,827 


71,232 


7.1 


7.4 


6 


7,075 


71,963 


7.3 


7.5 


7 


7,166 


74,062 


7.4 


7.7 


8 


7,559 


74,548 


7.8 


7.7 


9 


8,327 


84,067 


8.6 


8.7 


10 


8,142 


75,811 


8.4 


7.9 


11 


7,888 


67,735 


8.2 


7.0 


12 


7,499 


62,359 


7.8 


6.5 


Ungraded 


120 


6,390 


0.1 


0.7 


Poverty status 


Eligible for free lunch 


18,820 


310,723 


19.5 


32.3 


Eligible for reduced-price lunch 


6,667 


70,825 


6.9 


7.4 


English language learner status 


Limited English proficiency/ 










English language learner 


1,775 


82,806 


1.8 


8.6 


Disability status 


With Individualized Education Program 


10,915 


130,940 


11.3 


13.6 



a. National average is calculated as the total divided by the 50 states and the District of Columbia. 
Source: U.S. Department of Education, National Center for Education Statistics (2006a, b,c). 
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APPENDIX D 

NEW ENGLAND COMMON ASSESSMENT 
PROGRAM CASES USED IN ANALYSES 



TABLE D1 

Total number of available cases in Vermont's New England Common Assessment Program (NECAP) dataset 
and number of cases used in analyses, 2006 



Content area 




Grades 4 and 5 






Grade 8 






Used 






Used 




Total 


Number 


Percent 


Total 


Number 


Percent 


Reading 


Male students 


3,351 


3,338 


99.6 


3,761 


3,739 


99.4 


Female students 


3,124 


3,119 


99.8 


3,474 


3,462 


99.7 


Students in poverty 


2,013 


2,004 


99.6 


1,943 


1,922 


98.9 


Students not in poverty 


4,468 


4,459 


99.8 


5,292 


5,279 


99.8 


Students with disabilities 


748 


734 


98.1 


1,034 


1,005 


97.2 


Students without disabilities 


5,733 


5,729 


99.9 


6,201 


6,196 


99.9 


Math 


Male students 


3,360 


3,349 


99.7 


3,751 


3,733 


99.5 


Female students 


3,125 


3,119 


99.8 


3,476 


3,465 


99.7 


Students in poverty 


2,017 


2,011 


99.7 


1,941 


1,923 


99.1 


Students not in poverty 


4,474 


4,463 


99.8 


5,286 


5,275 


99.8 


Students with disabilities 


751 


736 


98.0 


1,022 


998 


97.7 


Students without disabilities 


5,740 


5,738 


100.0 


6,205 


6,200 


99.9 


Writing 


Male students 


3,324 


3,324 


100.0 


3,719 


3,719 


100.0 


Female students 


3,159 


3,159 


100.0 


3,454 


3,453 


100.0 


Students in poverty 


2,031 


2,031 


100.0 


1,909 


1,908 


100.0 


Students not in poverty 


4,452 


4,452 


100.0 


5,264 


5,264 


100.0 


Students with disabilities 


782 


782 


100.0 


994 


993 


99.9 


Students without disabilities 


5,701 


5,701 


100.0 


6,179 


6,179 


100.0 



Note: The NECAP reading and math assessments were administered to students in grades 4 and 8 in 2006. The NECAP writing assessment was administered 
to students in grades 5 and 8 that same year. 

Source; Authors' tabulations of student-level NECAP data (Vermont Department of Education, Standards and Assessment 2007) 
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APPENDIX E 

VERMONT AND U.S. NATIONAL ASSESSMENT OF 
EDUCATIONAL PROGRESS SCALE SCORES AND GAPS 
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estimates were calculated as = V(Sff +Sf« ). Effect size is calculated as |d| = (F-M)/ where SDpppip^ = ^[(SDp+SDf/) / 2], U 3 is the proportion of scores in the group 

lower mean that fall below the noted effect size in a standard normal distribution. See appendix A for details. 

Source; Authors' analysis based on 2002, 2003, 2005, and 2007 NAEP reading data for grade 4 from U.S. Department of Education, National Center for Education Statistics (2008). 
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TABLE E2 

Differences in Vermont and U.S. National Assessment of Educational Progress gender gaps in grade 4 
reading scores by student poverty and disability status, 2002, 2003, 2005, and 2007 







2002 






2003 






2005 






2007 




Students 


Vermont 


National 


Difference 
in gaps 


Vermont 


National 


Difference 
in gaps 


Vermont 


National 


Difference 
in gaps 


Vermont 


National 


Difference 
in gaps 


Total 


Gender gap 
(points) 


8.4 


6.5 


1.9 


5.4 


7.5 


-2.1 


7.1 


6.3 


0.8 


7.6 


6.7 


0.9 


Standard 

error 


2.05 


0.70 


2.16 


1.66 


0.45 


1.72 


1.65 


0.37 


1.69 


1.61 


0.41 


1.66 


Students in poverty 


Gender gap 
(points) 


5.8 


7.0 


-1.2 


2.7 


8.2 


-5.5** 


0.5 


6.6 


-6.1 


6.2 


7.4 


-1.2 


Standard 

error 


3.68 


1.07 


3.83 


1.70 


0.58 


1.80 


3.25 


0.45 


3.28 


2.98 


0.50 


3.02 


Students not in poverty 


Gender gap 
(points) 


8.9 


7.0 


1.9 


7.4 


7.5 


-0.1 


9.2 


6.5 


2.7** 


8.2 


6.3 


1.9 


Standard 

error 


2.42 


0.69 


2.52 


1.67 


0.55 


1.76 


1.31 


0.40 


1.37 


1.77 


0.41 


1.82 


Students with disabilities 


Gender gap 
(points) 





2.8 





-4.2 


4.5 


-8.7 


-5.5 


1.7 


-7.2 


-5.7 


0.7 


-6.4 


Standard 

error 





1.45 





5.35 


1.28 


5.50 


5.06 


0.98 


5.15 


5.23 


1.07 


5.34 


Students without disabilities 


Gender gap 
(points) 


7.0 


5.3 


1.7 


4.3 


5.8 


-1.5 


5.4 


4.9 


0.5 


4.6 


5.3 


-0.7 


Standard 

error 


1.99 


0.76 


2.13 


1.74 


0.48 


1.81 


1.63 


0.38 


1.68 


1.53 


0.40 


1.58 



**The difference in the Vermont and national gender gap estimates is statistically significant (different from zero) at p < 0.05. 

— indicates that reporting standards for the NAEP were not met. 

Wofe; Gender gaps are defined as the mean scale score of girls minus the mean scale score of boys (F-M). Standard errors for the difference in Vermont and 
national gender gap estimates were calculated as SE^Tgap-u.s.gap = + SFu.s.gop^)- 

Source; Authors' analysis based on 2002, 2003, 2005, and 2007 NAEP reading data for grade 4 from U.S. Department of Education, National Center for Educa- 
tion Statistics (2008). 
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TABLE E4 

Differences in Vermont and U.S. National Assessment of Educational Progress gender gaps in grade 8 
reading scores by student poverty and disability status, 2002, 2003, 2005, and 2007 







2002 






2003 






2005 






2007 










Difference 






Difference 






Difference 






Difference 


Students 


Vermont 


National 


in gaps 


Vermont 


National 


in gaps 


Vermont 


National 


in gaps 


Vermont 


National 


in gaps 


Total 


9.3 


9.4 


-0.1 


11.4 


10.5 


0.9 


13.4 


10.4 


3.0 


10.6 


10.1 


0.5 


1.71 


0.73 


1.86 


1.67 


0.39 


1.71 


1.64 


0.31 


1.67 


1.76 


0.36 


1.80 


Students in poverty 


Gender gap 
(points) 


9.7 


8.5 


1.2 


9.7 


10.7 


-1.0 


10.6 


10.1 


0.5 


11.0 


10.0 


1.0 


Standard 


























error 


3.58 


0.88 


3.69 


2.81 


0.62 


2.88 


3.01 


0.47 


3.05 


2.85 


0.50 


2.90 


Students not in poverty 


Gender gap 
(points) 


8.8 


10.2 


-1.4 


12.3 


10.8 


1.5 


14.1 


11.1 


3.0 


9.9 


10.3 


-0.4 


Standard 


























error 


1.86 


0.80 


2.03 


1.83 


0.47 


1.88 


1.72 


0.33 


1.75 


1.96 


0.42 


2.01 


Students with disabilities 


Gender gap 
(points) 


4.3 


4.8 


-0.5 


1.6 


5.2 


-3.6 


3.5 


5.3 


-1.8 


-2.4 


3.8 


-6.2 


Standard 


























error 


4.41 


1.93 


4.81 


4.50 


1.26 


4.68 


4.15 


1.01 


4.27 


4.86 


1.01 


4.96 


Students without disabilities 


Gender gap 
(points) 


7.4 


7.8 


-0.3 


9.8 


8.4 


1.4 


10.1 


8.7 


1.4 


9.0 


8.4 


0.6 


Standard 


























error 


1.65 


0.72 


1.80 


1.67 


0.40 


1.72 


1.54 


0.30 


1.57 


1.70 


0.36 


1.74 



Wofe; Gender gaps are defined as the mean scale score of girls minus the mean scale score of boys (F-M). Standard errors for the difference in Vermont and 
national gender gap estimates were calculated x SEyjg^p_u^ = 4[5F,jjgcip + g^p). See appendix A for details. 

Source; Authors' analysis based on 2002, 2003, 2005, and 2007 NAEP reading data for grade 8 from U.S. Department of Education, National Center for Educa- 
tion Statistics (2008). 
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TABLE E5 

Grades 4 and 8 National Assessment of Educational Progress average scaled writing scores by gender and 
student poverty and disability status in Vermont and the country, 2002 





Grade 4 






Grade 8 




Vermont 




National 


Vermont 




National 



Students 


Mean 


Standard 

error 


Mean 


Standard 

error 


Mean 


Standard 

error 


Mean 


Standard 

error 


Total 




147.3 


1.60 


144.1 


0.58 


151.2 


1.84 


141.3 


0.68 




(35.2) 


1.16 


(35.0) 


0.51 


(36.8) 


1.11 


(36.7) 


0.38 




168.6 


1.77 


161.6 


0.43 


175.4 


1.34 


162.2 


0.63 




(35.0) 


1.18 


(35.3) 


0.32 


(36.4) 


0.88 


(35.8) 


0.28 


Gender gap (points) 


21.3** 


2.39** 


17.5** 


0.72** 


24.2** 


2.28** 


20.9** 


0.93** 


Effect size (d) 


0.61 




0.50 




0.66 




0.58 




Us 


.73 




.69 




.75 




.72 




Students in poverty 




129.5 


2.20 


132.4 


1.00 


130.9 


3.01 


126.4 


0.61 




(34.1) 


2.60 


(33.2) 


0.88 


(37.8) 


1.77 


(35.0) 


0.38 




153.6 


2.81 


148.9 


0.67 


157 


3.34 


145.8 


0.71 




(32.6) 


1.71 


(34.0) 


0.42 


(37.0) 


1.86 


(34.) 


0.37 


Gender gap (points) 


24.1** 


3.57** 


16.5** 


1.20** 


26.1** 


4.50** 


19.4** 


0.94** 


Effect size (d) 


0.72 




0.49 




0.70 




0.56 




Us 


.76 




.69 




.76 




.71 




Students not in poverty 




152.2 


1.95 


153.7 


0.56 


156.6 


2.05 


150.1 


0.74 




(33.6) 


1.30 


(33.2) 


0.43 


(34.4) 


1.46 


(34.5) 


0.32 




174.8 


2.19 


172.6 


0.51 


180.7 


1.38 


172.3 


0.79 




(33.81) 


1.47 


(32.7) 


0.36 


(34.4) 


0.90 


(32.3) 


0.30 


Gender gap (points) 


22.6** 


2.93** 


18.9** 


0.76** 


24.1** 


2.47** 


22.2** 


1.08** 


Effect size (d) 


0.67 




0.57 




0.70 




0.66 




Us 


.75 




.72 




.76 




.75 




Students with disabilities 




116.1 


3.43 


117.5 


0.86 


124.2 


3.06 


106.8 


1.03 


Mai© stud6nts — 


(31.3) 


2.95 


(33.4) 


0.61 


(32.8) 


1.77 


(34.1) 


0.68 




122.4 


6.49 


127.1 


1.29 


131.6 


3.36 


120.5 


1.68 


Female students — 


(29.8) 


3.49 


(35.2) 


1.11 


(35.7) 


2.55 


(35.6) 


0.85 


Gender gap (points) 


6.3 


7.34** 


9.6** 


1.55** 


7.4 


4.54** 


13.7** 


1.97** 


Effect size (d) 


0.21 




0.28 




0.22 




0.39 




Us 


.58 




.61 




.59 




.65 





(CONTINUED) 
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TABLE E5 (CONTINUED) 

Grades 4 and 8 National Assessment of Educational Progress average scaled writing scores by gender and 
student poverty and disability status in Vermont and the country, 2002 





Grade 4 




Grade 8 






Vermont 


National 


Vermont 


National 


Students 


Standard Standard 

Mean error Mean error 


Standard 

Mean error 


Standard 

Mean error 



Students without disabilities 


Male students 


152.4 


1.59 


147.6 


0.66 


156.8 


1.99 


146.7 


0.72 


(33.1) 


1.13 


(33.6) 


0.62 


(35.0) 


1.54 


(34.1) 


0.36 


Female students 


172.4 


1.70 


164.1 


0.42 


180.1 


1.47 


165.5 


0.62 


(32.6) 


0.96 


(34.0) 


0.32 


(33.2) 


1.05 


(33.7) 


0.31 


Gender gap (points) 


20.0** 


2.33** 


16.5** 


0.78** 


23.3** 


2.47** 


18.8** 


0.95** 


Effect size (d) 


0.61 




0.49 




0.68 




0.55 




Us 


.73 




.69 




.75 




.71 





**Gender gap is statistically significant (different from zero) at p < 0.05. 

Note: Numbers in parentheses are standard deviations. Gender gaps are defined as the mean scale score of girls minus the mean scale score of boys 
(F-M). Standard errors for the gender gap estimates were calculated as SE^_^ = + SE,/). Effect size is calculated as |c/| = {F - M) / where 

5Dpoo/,d = V[(SDp^ + SD^^)/2], U3 is the proportion of scores in the group with the lower mean that fall below the noted effect size in a standard normal 
distribution. See appendix A for details. 

Source: Authors' analysis based on 2002 NAEP writing data for grades 4 and 8 from U.S. Department of Education, National Center for Education Statistics (2008). 
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TABLE E6 

Differences in Vermont and U.S. National Assessment of Educational Progress gender gaps in grades 4 and 8 
writing scores by student poverty and disability status, 2002 







Grade 4 






Grade 8 




Students 


Vermont 


National 


Difference 
in gaps 


Vermont 


National 


Difference 
in gaps 


Total 


Gender gap (points) 


21.3 


17.5 


3.8 


24.2 


20.9 


3.3 


Standard error 


2.39 


0.72 


2.49 


2.28 


0.93 


2.46 


Students in poverty 


Gender gap (points) 


24.1 


16.5 


7.6** 


26.1 


19.4 


6.7 


Standard error 


3.57 


1.20 


3.77 


4.50 


0.94 


4.59 


Students not in poverty 


Gender gap (points) 


22.6 


18.9 


3.7 


24.1 


22.2 


1.9 


Standard error 


2.93 


0.76 


3.03 


2.47 


1.08 


2.70 


Students with disabilities 


Gender gap 


6.3 


9.6 


-3.3 


7.4 


13.7 


-6.3 


Standard error 


7.34 


1.55 


7.50 


4.54 


1.97 


4.95 


Students without disabilities 


Gender gap (points) 


20.0 


16.5 


3.5 


23.3 


18.8 


4.5 


Standard error 


2.33 


0.78 


2.46 


2.47 


0.95 


2.65 



**The difference in the Vermont and national gender gap estimates is statistically significant (different from zero) at p < 0.05. 

Note; Standard errors for the difference in Vermont and national gender gap estimates were calculated as S£i/,gop-u.s. gap = V(Sfi,7-gop^ + SEu 5 Gender 
gaps are defined as the mean scale score of girls minus the mean scale score of boys [F- M). See appendix A for details. 

Source: Authors' analysis based on 2002 NAEP writing data for grades 4 and 8 from U.S. Department of Education, National Center for Education Statistics (2008). 
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TABLE E7 (CONTINUED) 

Grade 4 National Assessment of Educational Progress average scaled math scores by gender and student poverty and disability status in 
Vermont and the country, 2000, 2003, 2005, and 2007 
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Source; Authors' analysis based on NAEP math data for grade 4 in 2000, 2003, 2005, and 2007 from U.S. Department of Education, National Center for Education Statistics (2008). 
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TABLE E8 

Differences in Vermont and U.S. National Assessment of Educational Progress gender gaps in grade 4 math 
scores by student poverty and disability status, 2000, 2003, 2005, and 2007 







2000 






2003 






2005 






2007 




Students 


Vermont 


National 


Difference 
in gaps 


Vermont 


National 


Difference 
in gaps 


Vermont 


National 


Difference 
in gaps 


Vermont 


National 


Difference 
in gaps 


Total 


Gender gap 
(points) 


-1.7 


-2.5 


0.8 


-3.5 


-2.7 


-0.8 


-5.1 


-2.4 


- 2 . 7 ** 


-2.7 


-2.3 


-0.4 


Standard 

error 


2.55 


1.51 


2.96 


1.36 


0.36 


1.41 


1.05 


0.25 


1.08 


0.98 


0.28 


1.02 


Students in poverty 


Gender gap 
(points) 


-3.4 


-0.7 


-2.7 


-5.3 


-2.2 


-3.1 


-4.7 


-1.7 


-3.0 


-3.0 


-1.3 


-1.7 


Standard 

error 


3.86 


1.61 


4.18 


2.20 


0.45 


2.25 


2.10 


0.33 


2.13 


1.85 


0.35 


1.88 


Students not in poverty 


Gender gap 
(points) 


-1.2 


-4.2 


3.0 


-2.3 


-3.0 


0.7 


-4.3 


-2.6 


-1.7 


-2.5 


-2.6 


0.1 


Standard 

error 


2.99 


1.89 


3.54 


1.38 


0.43 


1.44 


1.23 


0.30 


1.27 


1.22 


0.35 


1.28 


Students with disabilities 


Gender gap 
(points) 





-11.1 





-11.5 


-6.6 


-4.9 


-10.0 


-6.9 


-3.1 


-8.3 


-6.3 


-2.0 


Standard 

error 





4.22 





3.52 


0.76 


3.60 


3.59 


0.75 


3.66 


3.09 


0.70 


3.17 


Students without disabilities 


Gender gap 
(points) 


-1.8 


-3.7 


1.9 


-4.9 


-4.1 


-0.8 


-6.9 


-3.6 


-3.3** 


-4.5 


-3.4 


-1.1 


Standard 

error 


2.72 


1.46 


3.09 


1.34 


0.37 


1.39 


1.13 


0.27 


1.16 


1.08 


0.30 


1.12 



**The difference in the Vermont and national gender gap estimates is statistically significant (different from zero) at p < 0.05. 

— indicates that reporting standards for the NAEP were not met. 

Wofe; Gender gaps are defined as the mean scale score of girls minus the mean scale score of boys (F-M). Standard errors for the gender gap estimates were 
calculated as SE^ = V(S£f^ + SE^^). See appendix A for details. 

Source: Authors' analysis based on NAEP math data for grade 4 in 2000, 2003, 2005, and 2007 from U.S. Department of Education, National Center for Educa- 
tion Statistics (2008). 
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GENDER GAPS IN ASSESSMENT OUTCOMES IN VERMONT AND THE UNITED STATES 



TABLE E10 

Differences in Vermont and U.S. National Assessment of Educational Progress gender gaps in grade 8 math 
scores by student poverty and disability status, 2000, 2003, 2005, and 2007 







2000 






2003 






2005 






2007 




Students 


Vermont 


National 


Difference 
in gaps 


Vermont 


National 


Difference 
in gaps 


Vermont 


National 


Difference 
in gaps 


Vermont 


National 


Difference 
in gaps 


Total 


Gender gap 
(points) 


2.7 


-1.6 


4.3 


-0.1 


-1.8 


1.7 


0.1 


-1.4 


1.5 


-1.8 


-1.9 


0.1 


Standard 

error 


2.51 


1.44 


2.90 


1.39 


0.46 


1.47 


1.46 


0.33 


1.50 


1.48 


0.41 


1.54 


Students in poverty 


Gender gap 
(points) 


-0.1 


0.3 


-0.4 


-3.2 


-1.6 


-1.6 


-0.1 


-0.9 


0.8 


-5.9 


-1.2 


-4.7 


Standard 

error 


5.03 


1.82 


5.35 


2.87 


0.62 


2.94 


2.85 


0.40 


2.88 


2.78 


0.56 


2.84 


Students not in poverty 


Gender gap 
(points) 


3.4 


-0.8 


4.2 


0.8 


-1.2 


2.0 


-0.5 


-1.5 


1.0 


0.5 


-2.1 


2.6 


Standard 

error 


2.37 


1.75 


2.94 


1.48 


0.49 


1.56 


1.73 


0.37 


1.77 


1.79 


0.48 


1.85 


Students with disabilities 


Gender gap 
(points) 





-11.1 





-14.5 


-7.9 


-6.6 


-10.0 


-8.4 


-1.6 


-9.9 


-7.8 


-2.1 


Standard 

error 





4.04 





3.71 


1.16 


3.89 


4.15 


0.89 


4.24 


4.35 


1.26 


4.53 


Students without disabilities 


Gender gap 
(points) 


-0.8 


-3.3 


2.5 


-1.7 


-4.2 


2.5 


-2.7 


-3.4 


0.7 


-4.4 


-3.9 


-0.5 


Standard 

error 


2.29 


1.43 


2.70 


1.44 


0.43 


1.50 


1.53 


0.33 


1.56 


1.44 


0.40 


1.49 



— indicates that reporting standards for the NAEP were not met. 

Wore; Gender gaps are defined as the mean scale score of girls minus the mean scale score of boys (F-M). Standard errors for the gender gap estimates were 
calculated as SEp_f^ = -J{SE/ + SE^/). See appendix A for details. 

Source; Authors' analysis based on NAEP 2000, 2003, 2005, and 2007 math data for grade 8 from U.S. Department of Education, National Center for Educa- 
tion Statistics (2008). 
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Source; Authors' analysis based on 2002, 2003, 2005, and 2007 NAEP reading data for grade 4 from U.S. Department of Education, National Center for Education Statistics (2008). 






TABLE E12 

Grade 8 National Assessment of Educational Progress average scaled reading scores and student poverty and disability gaps by student 
poverty and disability status in Vermont and the country, 2002, 2003, 2005, and 2007 
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TABLE El 3 

Grades 4 and 8 National Assessment of Educational Progress average scaled writing scores and poverty and 
disability gaps in Vermont and the country, 2002 



Student 

status 




Grade 4 






Grade 8 




Vermont 


National 


Vermont 


National 


Mean 


Standard 

error 


Mean 


Standard 

error 


Mean 


Standard 

error 


Mean 


Standard 

error 


Poverty status 


Students in 


142.6 


1.9 


140.5 


0.8 


143.7 


2.5 


135.9 


0.5 


poverty 


(34.3) 


1.3 


(34.6) 


0.6 


(39.7) 


1.4 


(36.2) 


0.3 


Students not 


163.2 


1.7 


163.0 


0.5 


168.2 


1.3 


161.2 


0.7 


in poverty 


(35.6) 


1.1 


(34.3) 


0.3 


(36.5) 


0.9 


(35.2) 


0.3 


Poverty gap 


20.6** 


2.6** 


22.5** 


0.9** 


24.5** 


2.8** 


25.3** 


0.9** 


Effect size (d) 


0.59 




0.65 




0.64 




0.71 




Us 


.72 




.74 




.74 




.76 




Disability status 


Students with 


118.3 


3.3 


120.9 


0.7 


126.8 


2.3 


111.5 


1.1 


disabilities 


(31.0) 


2.2 


(34.4) 


0.6 


(34.0) 


1.3 


(35.2) 


0.6 


Students without 


162.8 


1.4 


155.9 


0.5 


168.5 


1.3 


156.3 


0.6 


disabilities 


(34.3) 


0.8 


(34.8) 


0.4 


(36.0) 


1.0 


(35.1) 


0.3 


Disability gap 


44.5** 


3.6** 


35.0** 


0.9** 


41.7** 


2.6** 


44.8** 


1.3** 


Effect size (d) 


1.36 




1.01 




1.19 




1.27 




Us 


.91 




.84 




.88 




.90 





**The difference in average scores between students not in poverty and students in poverty, or between students without disabilities and students with 
disabilities, is statistically significant (different from zero) at p < 0.05. 

Note: Numbers in parentheses are standard deviations. Standard errors for the poverty gap (students not in poverty - students in poverty, or NP-P) were 
calculated as S£jvp-p = V(SEnp^ + SEp^). Standard errors for the disability gap (students without disabilities - students with disabilities, or WD-D) were calcu- 
lated as SEfiQ_o = V(Sf/vD^ + SEq^). Effect size is calculated as |d| = (M, - M 2 ) / where = V[(SD,^ -1- SDj^) / 2]. L/3 is the proportion of scores in 

the group with the lower mean that fall below the noted effect size in a standard normal distribution. See appendix A for details. 

Source: Authors' analysis based on 2002 NAEP writing data for grades 4 and 8 from U.S. Department of Education, National Center for Education Statistics (2008). 






TABLE E14 

Grade 4 National Assessment of Educational Progress average scaled math scores and poverty and disability gaps in Vermont and the country, 
2000, 2003, 2005, and 2007 
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Source: Authors' analysis based on 2000, 2003, 2005, and 2007 N AEP math data for grade 4 from U.S. Department of Education, National Center for Education Statistics (2008). 






TABLE E15 

Grade 8 National Assessment of Educational Progress average scaled math scores and poverty and disability gaps in Vermont and the country, 
2000, 2003, 2005, and 2007 
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GENDER GAPS IN ASSESSMENT OUTCOMES IN VERMONT AND THE UNITED STATES 



TABLE El 6 

Differences in Vermont and U.S. National Assessment of Educational Progress poverty and disability gaps in 
grades 4 and 8 reading, writing, and math scores, 2000, 2002, 2003, 2005, and 2007 



Content 


2000 or 2002 




2003 






2005 






2007 




area 


Vermont 


National 


Difference 


Vermont 


National 


Difference 


Vermont 


National 


Difference 


Vermont 


National 


Difference 


Reading® 


Grade 4 


Poverty gap 


20.1 


26.8 


-6.7** 


17.6 


27.9 


-10.3** 


23.3 


27.0 


-3.7 


22.8 


26.7 


-3.9** 


Standard 

error 


2.36 


0.81 


2.49 


1.53 


0.47 


1.60 


1.88 


0.34 


1.91 


1.74 


0.40 


1.78 


Disability 

gap 


31.3 


32.9 


-1.6 


26.6 


35.4 


-8.8** 


36.8 


30.6 


6.2** 


39.6 


32.6 


7.0** 


Standard 

error 


3.28 


0.95 


3.41 


2.79 


0.66 


2.87 


2.89 


0.58 


2.95 


2.43 


0.62 


2.51 


Grade 8 


Poverty gap 


19.4 


22.3 


-2.9 


20.3 


24.8 


-4.5** 


19.4 


23.2 


-3.8** 


17.3 


23.5 


-6.2** 


Standard 

error 


2.06 


0.70 


2.18 


1.80 


0.48 


1.86 


1.65 


0.33 


1.68 


1.67 


0.39 


1.71 


Disability 

gap 


28.9 


38.7 


-9.8** 


29.1 


41.1 


-12.0** 


38.9 


37.7 


1.2 


29.3 


38.3 


-9.0** 


Standard 

error 


2.62 


1.12 


2.85 


2.19 


0.66 


2.29 


1.88 


0.53 


1.96 


2.58 


0.58 


2.64 


Writing‘s 


Grade 4 


Poverty gap 


20.6 


22.5 


-1.9 


na 


na 


na 


na 


na 


na 


na 


na 


na 


Standard 

error 


2.55 


0.89 


2.71 


na 


na 


na 


na 


na 


na 


na 


na 


na 


Disability 

gap 


44.5 


35.0 


9.5 


na 


na 


na 


na 


na 


na 


na 


na 


na 


Standard 

error 


3.58 


0.89 


3.69 


na 


na 


na 


na 


na 


na 


na 


na 


na 


Grade 8 


Poverty gap 


24.5 


25.3 


-0.8 


na 


na 


na 


na 


na 


na 


na 


na 


na 


Standard 

error 


2.81 


0.91 


2.95 


na 


na 


na 


na 


na 


na 


na 


na 


na 


Disability 

gap 


41.7 


44.8 


-3.1 


na 


na 


na 


na 


na 


na 


na 


na 


na 


Standard 

error 


2.64 


1.25 


2.92 


na 


na 


na 


na 


na 


na 


na 


na 


na 


Math^ 


Grade 4 


Poverty 


21.9 


26.4 


-4.5 


18.9 


22.8 


-3.9** 


19.3 


22.3 


-3.0** 


17.4 


22.1 


-4.7** 


gap 


2.79 


1.51 


3.18 


1.46 


0.37 


1.51 


1.23 


0.26 


1.25 


1.16 


0.30 


1.19 


Disability 


17.0 


29.3 


-12.3** 


24.1 


22.3 


1.8 


22.4 


21.1 


1.3 


30.2 


21.2 


9.0** 


gap 


4.66 


2.44 


5.26 


1.99 


0.46 


2.05 


1.70 


0.44 


1.76 


1.52 


0.43 


1.58 
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TABLE E16 (CONTINUED) 

Differences in Vermont and U.S. National Assessment of Educational Progress poverty and disability gaps in 
grades 4 and 8 reading, writing, and math scores, 2000, 2002, 2003, 2005, and 2007 



Content 


2000 or 2002 




2003 






2005 






2007 




area 


Vermont 


National 


Difference 


Vermont 


National 


Difference 


Vermont 


National 


Difference 


Vermont 


National 


Difference 


Grade 8 


Poverty 


25.0 


30.1 


-5.1 


23.1 


28.4 


-5.3** 


21.1 


26.7 


-5.6** 


18.7 


26.0 


-7.3** 


gap 


3.27 


1.61 


3.65 


1.63 


0.46 


1.69 


1.69 


0.32 


1.72 


1.71 


0.44 


1.77 


Disability 


40.9 


46.5 


-5.6 


33.0 


38.6 


-5.6** 


36.0 


37.5 


-1.5 


35.0 


37.7 


-2.7 


gap 


5.97 


2.43 


6.44 


2.00 


0.65 


2.11 


2.00 


0.51 


2.06 


2.27 


0.73 


2.39 



**The difference in the Vermont and national poverty and disability gap estimates is statistically significant (different from zero) at p < 0.05. 

na is not applicable because the NAEP writing assessment was not conducted in 2003 or 2005 and data for 2007 were not available at the time of this study. 

a. Data are for 2002, 2003, 2005, and 2007. 

b. Data are for 2002. 

c. Data are for 2000, 2003, 2005, and 2007. 

Wore; The poverty gap is the mean scale score of students not in poverty minus the mean scale score of students in poverty (NP-P). The disability gap is 
the mean scale score of students without disabilities minus the mean scale score of students with disabilities (WD-D). Standard errors for the difference in 
Vermont and national poverty and disability gap estimates were calculated as 5EvTgap-u.s.gap = s gap^)- Seeappendix A for details. 

Source: Authors' analysis based on NAEP data from U.S. Department of Education, National Center for Education Statistics (2008). 
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APPENDIX F 

VERMONT NEW ENGLAND COMMON 
ASSESSMENT PROGRAM AND NATIONAL 
ASSESSMENT OF EDUCATIONAL PROGRESS 
SCALE SCORES AND EFFECT SIZES 

TABLE FI 

Grades 4 and 8 Vermont average scaled reading scores and effect sizes by gender and student poverty 
and disability status from the 2006 New England Common Assessment Program and the 2007 National 
Assessment of Educational Progress, 2006-07 







Grade 4 






Grade 8 






2006 NECAP 


2007 NAEP 


2006 NECAP 


2007 NAEP 


Students and 
outcome statistic 


Mean 


Standard 

error 


Mean 


Standard 

error 


Mean 


Standard 

error 


Mean 


Standard 

error 


Total 


Male students 


442.4 


0.23 


224.5 


1.30 


842.0 


0.22 


267.6 


1.27 


(12.3) 




(34.4) 




(13.3) 




(30.5) 




Female students 


446.5 


0.23 


232.1 


0.94 


847.2 


0.22 


278.2 


1.22 


(12.9) 




(33.2) 




(12.9) 




(29.3) 




Difference (percentile points) 


4.1** 


0.33** 


7.6** 


1.61** 


5.2** 


0.31** 


10.6** 


1.76** 


Effect size (d) 


0.32 




0.22 




0.40 




0.35 




Us 


.63 




.59 




.66 




.64 




Students in poverty 


Male students 


436.6 


0.44 


209.4 


2.34 


835.8 


0.42 


255.0 


2.23 


(14.2) 




(33.9) 




(13.3) 




(33.9) 




Female students 


441.2 


0.42 


215.6 


1.84 


840.9 


0.43 


266.0 


1.78 


(13.2) 




(31.3) 




(13.0) 




(31.3) 




Difference (percentile points) 


4.6** 


0.61** 


6.2** 


2.98** 


5.1** 


0.60** 


11.0** 


2.85** 


Effect size (d) 


0.34 




0.19 




0.39 




0.34 




Us 


.63 




.58 




.65 




.63 




Students not in poverty 


Male students 


445.0 


0.25 


231.3 


1.31 


844.2 


0.24 


272.4 


1.33 


(12.0) 




(32.4) 




(12.6) 




(32.4) 




Female students 


448.9 


0.26 


239.5 


1.19 


849.5 


0.24 


282.3 


1.44 


(12.1) 




(31.3) 




(12.0) 




(31.3) 




Difference (percentile points) 


3.9** 


0.36** 


8.2** 


1.77** 


5.3** 


0.34** 


9 9 ** 


1.96** 


Effect size (d) 


0.32 




0.26 




0.43 




0.31 




Us 


.63 




.60 




.67 




.62 
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TABLE FI (CONTINUED) 

Grades 4 and 8 Vermont average scaled reading scores and effect sizes by gender and student poverty 
and disability status from the 2006 New England Common Assessment Program and the 2007 National 
Assessment of Educational Progress, 2006-07 



Grade 4 


Grade 8 


2006 NECAP 2007 NAEP 


2006 NECAP 2007 NAEP 



Students and 




Standard 




Standard 




Standard 




Standard 


outcome statistic 


Mean 


error 


Mean 


error 


Mean 


error 


Mean 


error 



Students with disabilities 


Male students 


427.5 


0.67 


195.6 


2.93 


827.2 


0.45 


249.0 


2.80 


(15.1) 




(36.0) 




(11.7) 




(32.6) 




Female students 


425.7 


0.95 


189.9 


4.34 


827.9 


0.64 


246.6 


3.97 


(14.5) 




(41.8) 




(11.4) 




(35.0) 




Difference (percentile points) 


-1.8** 


1.16** 


-5.7 


5.23** 


0.7 


0.78** 


-2.4 


4.86** 


Effect size (d) 


-0.12 




-0.15 




0.06 




-0.07 




Us 


.45 




.44 




.52 




.47 




Students without disabilities 


Male students 


445.0 


0.21 


231.1 


1.21 


845.3 


0.20 


272.6 


1.21 


(11.0) 




(30.4) 




(11.3) 




(27.9) 




Female students 


448.2 


0.21 


235.7 


0.93 


849.1 


0.20 


281.6 


1.20 


(11.2) 




(29.7) 




(11.3) 




(26.5) 




Difference (percentile points) 


3.2** 


0.29** 


4.6** 


1.53** 


3.9** 


0.29** 


9.0** 


1.70** 


Effect size (d) 


0.28 




0.15 




0.34 




0.33 




Us 


.61 




.56 




.63 




.63 





**Gender gap is statistically significant (different from zero) at p < 0.05. 

Note: Numbers in parentheses are standard deviations. Effect size is calculated as |d| = M 2 ) / where = V[(SD,^ + SD2^)/2], is the 

proportion of scores in the group with the lower mean that fall below the noted effect size in a standard normal distribution. Seeappendix A for details. 

Source: Authors' analysis based on NECAP data from Vermont Department of Education, Standards and Assessment (2007) and NAEP data from U.S. Depart- 
ment of Education, National Center for Education Statistics (2008). 
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TABLE F2 



Grades 5 and 8 Vermont average scaled writing scores and effect sizes by gender and student poverty and 
disability status from the 2006 New England Common Assessment Program, 2006 



Students and 




Grade 5 




Grade 8 


outcome statistics 


Mean 


Standard error 


Mean 


Standard error 


Total 


Male students 


536.7 


0.27 


835.8 


0.23 


(15.3) 




(13.9) 




Female students 


543.3 


0.29 


844.5 


0.23 


(16.2) 




(13.8) 




Difference (percentile points) 


6.6** 


0.39** 


8.7** 


0.33** 


Effect size (d) 


0.42 




0.63 




Us 


.66 




.74 




Students in poverty 


Male students 


530.4 


0.44 


829.6 


0.41 


(14.4) 




(12.9) 




Female students 


536.7 


0.50 


838.2 


0.43 


(15.5) 




(13.2) 




Difference (percentile points) 


6.2** 


0.67** 


8.6** 


0.60** 


Effect size (d) 


0.42 




0.66 




Us 


.66 




.75 




Students not in poverty 


Male students 


539.5 


0.31 


838.0 


0.26 


(14.9) 




(13.5) 




Female students 


546.3 


0.34 


846.8 


0.26 


(15.6) 




(13.3) 




Difference (percentile points) 


6.7** 


0.46** 


8.8** 


0.37** 


Effect size (d) 


0.44 




0.66 




Us 


.67 




.75 




Students with disabilities 


Male students 


520.9 


0.56 


820.9 


0.44 


(12.4) 




(11.4) 




Female students 


521.3 


0.76 


826.6 


0.66 


(12.8) 




(11.6) 




Difference (percentile points) 


0.4 


0.94** 


5.7** 


0.79** 


Effect size (d) 


0.03 




0.49 




Us 


.51 




.69 
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TABLE F2 (CONTINUED) 

Grades 5 and 8 Vermont average scaled writing scores and effect sizes by gender and student poverty and 
disability status from the 2006 New England Common Assessment Program, 2006 



Students and 




Grade 5 




Grade 8 


outcome statistics 


Mean 


Standard error 


Mean 


Standard error 


Students without disabilities 


Male 


539.5 


0.27 


839.1 


0.22 


(14.1) 




(12.1) 




Female 


545.4 


0.28 


845.3 


0.23 


(14.9) 




(12.6) 




Difference 


6.0** 


0.38** 


6.2** 


0.31** 


Effect size (d) 


0.41 




0.50 




Us 


.66 




.69 





**The difference in average female scores and male scores (F-M) is statistically significant (different from zero) atp < 0.05. 

Note: Numbers in parentheses are standard deviations. Effect size is calculated as |d| = (M, -/W2) where SDpo„,jrf= V[(SD,^ + SD2^) / 2]. L/3 is the 

proportion of scores in the group with the lower mean that fall below the noted effect size in a standard normal distribution. Seeappendix A for details. 

Source: Authors' tabulations based on NECAP data from Vermont Department of Education, Standards and Assessment (2007) and NAEP data from U.S. 
Department of Education, National Center for Education Statistics (2008). 
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TABLE F3 

Grades 4 and 8 Vermont average scaled math scores and effect sizes by gender and student poverty 
and disability status from the 2006 New England Common Assessment Program and the 2007 National 
Assessment of Educational Progress, 2006-07 







Grade 4 






Grade 8 






2006 NECAP 


2007 NAEP 


2006 NECAP 


2007 NAEP 


Students and 




Standard 




Standard 




Standard 




Standard 


outcome statistics 


Mean 


error 


Mean 


error 


Mean 


error 


Mean 


error 


Total 


Male students 


443.3 


0.22 


247.7 


0.67 


840.9 


0.20 


291.9 


1.04 


(12.5) 




(27.2) 




(12.1) 




(35.2) 




Female students 


443.1 


0.22 


245.0 


0.72 


841.9 


0.19 


290.1 


1.06 


(12.3) 




(26.0) 




(11.0) 




(31.8) 




Difference (percentile points) 


0.2 


0.31** 


-2.7** 


0.98** 


0.9** 


0.27** 


1.8 


1.48** 


Effect size (d) 


0.02 




-0.10 




0.08 




0.05 




Us 


.51 




.46 




.53 




.52 




Students in poverty 


Male students 


437.7 


0.40 


235.9 


1.43 


835.9 


0.39 


280.5 


2.13 


(12.8) 




(27.1) 




(12.2) 




(32.7) 




Female students 


437.6 


0.40 


232.9 


1.17 


836.7 


0.36 


274.6 


1.79 


(12.3) 




(26.7) 




(11.0) 




(29.2) 




Difference (percentile points) 


0.0 


0.56** 


-3.0 


1.85** 


1.8 


0.53** 


-5.9** 


2.78** 


Effect size (d) 


0.00 




-0.11 




0.11 




-0.19 




Us 


.50 




.46 




.54 




.42 




Students not in poverty 


Male students 


445.9 


0.24 


253.0 


0.82 


842.7 


0.22 


295.9 


1.35 


(11.5) 




(25.5) 




(11.6) 




(35.2) 




Female students 


445.5 


0.25 


250.5 


0.91 


843.8 


0.21 


296.4 


1.17 


(11.6) 




(23.7) 




(10.4) 




(30.7) 




Difference (percentile points) 


-0.3 


0.34** 


-2.5 


1.22** 


1.1** 


0.30** 


0.5 


1.79** 


Effect size (d) 


-0.03 




-0.10 




0.10 




0.02 




Us 


.49 




.46 




.54 




.51 




Students with disabilities 


Male students 


431.4 


0.62 


223.6 


1.79 


827.7 


0.47 


265.0 


2.89 


(13.9) 




(28.5) 




(12.3) 




(36.2) 




Female students 


426.0 


0.84 


215.3 


2.52 


826.2 


0.65 


255.1 


3.25 


(12.9) 




(28.6) 




(11.6) 




(29.1) 




Difference (percentile points) 


-5.4** 


1.05** 


-8.3** 


3.09** 


-1.5 


0.81** 


-9.9** 


4.35** 


Effect size (d) 


-0.41 




-0.29 




-0.13 




-0.30 




Us 


.34 




.39 




.45 




.38 
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TABLE F3 (CONTINUED) 

Grades 4 and 8 Vermont average scaled math scores and effect sizes by gender and student poverty 
and disability status from the 2006 New England Common Assessment Program and the 2007 National 
Assessment of Educational Progress, 2006-07 



Grade 4 


Grade 8 


2006 NECAP 2007 NAEP 


2006 NECAP 2007 NAEP 



Students and 




Standard 




Standard 




Standard 




Standard 


outcome statistics 


Mean 


error 


Mean 


error 


Mean 


error 


Mean 


error 



Students without disabilities 


Male students 


445.4 


0.21 


253.1 


0.77 


843.9 


0.18 


298.8 


1.06 


(10.9) 




(23.8) 




(9.9) 




(31.5) 




Female students 


444.5 


0.21 


248.6 


0.76 


843.4 


0.17 


294.4 


0.97 


(11.2) 




(23.2) 




(9.7) 




(29.4) 




Difference (percentile points) 


-1.0** 


0.29** 


-4.5** 


1.08** 


-0.4 


0.25** 


-4,4** 


1.44** 


Effect size (d) 


-0.09 




-0.19 




-0.05 




-0.14 




Us 


.46 




.42 




.48 




.44 





**The difference in average female scores and male scores (F-M) is statistically significant (different from zero) atp < 0.05. 

Note: Numbers in parentheses are standard deviations. Effect size is calculated as |d| = (M, -/W2) where SDpo„,jrf= V[(SD,^ + SDj^) /2]. L/3 is the 

proportion of scores in the group with the lower mean that fall below the noted effect size in a standard normal distribution. See appendix A for details. 

Source; Authors' analysis of NECAP data from Vermont Department of Education, Standards and Assessment (2007) and NAEP data from U.S. Department of 
Education, National Center for Education Statistics (2008). 
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TABLE F4 

Grades 4 and 8 Vermont average scaled reading scores and effect sizes by poverty status and disability 
status from the 2006 New England Common Assessment Program and the 2007 National Assessment of 
Educational Progress, 2006-07 



Student status and 
outcome statistics 




Grade 4 






Grade 8 




2006 NECAP 


2007 NAEP 


2006 NECAP 


2007 NAEP 


Mean 


Standard 

error 


Mean 


Standard 

error 


Mean 


Standard 

error 


Mean 


Standard 

error 


Poverty status 


Students 


438.8 


0.31 


212.5 


1.48 


838.3 


0.31 


260.3 


1.39 


in poverty 


(13.9) 




(32.8) 




(13.4) 




(30.6) 




Students not 


446.9 


0.18 


235.3 


0.92 


846.7 


0.17 


277.6 


0.92 


in poverty 


(12.2) 




(32.1) 




(12.6) 




(29.0) 




Difference 


8.0** 


0.36** 


22.8** 


1.74** 


8.5** 


0.35** 


17.3** 


1.67** 


(percentile points) 


















Effect size (d) 


0.61 




0.70 




0.65 




0.58 




Us 


.73 




.76 




.74 




.72 




Disability status 


Students with 


426.9 


0.55 


193.9 


2.30 


824.5 


0.37 


248.2 


2.44 


disabilities 


(14.9) 




(37.9) 




(11.6) 




(33.4) 




Students without 


446.6 


0.15 


233.5 


0.80 


847.2 


0.15 


277.5 


0.83 


disabilities 


(11.3) 




(30.1) 




(11.4) 




(27.5) 




Difference 


19.7** 


0.57** 


39.6** 


2.43** 


22.7** 


0.39** 


29.3** 


2.58** 


Effect size (d) 


1.49 




1.16 




1.97 




0.96 




Us 


.93 




.88 




.98 




.83 





**The difference in average scores between students not in poverty and in poverty (NP - P), or students without disabilities and with disabilities (WD - D), is 
statistically significant (different from zero) atp < 0.05. 

Note: Numbers in parentheses are standard deviations. Effect size is calculated as |d| = (M, -/M 2 ) where SDpo„, 5 rf= V[(SD,^ + SD 2 ^) / 2]. L /3 is the 

proportion of scores in the group with the lower mean that fall below the noted effect size in a standard normal distribution. Seeappendix A for details. 

Source: Authors' analysis of NECAP data from Vermont Department of Education, Standards and Assessment (2007) and NAEP data from U.S. Department of 
Education, National Center for Education Statistics (2008). 
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TABLE F5 

Grades 5 and 8 Vermont average scaled writing scores and effect sizes by poverty status and disability status 
from the 2006 New England Common Assessment Program, 2006 







Grade 5 




Grade 8 


Student status 


Mean 


Standard error 


Mean 


Standard error 


Poverty status 


Students in poverty 


533.4 


0.34 


833.8 


0.32 


(15.3) 




(13.8) 




Students not in poverty 


542.8 


0.23 


842.2 


0.19 


(15.6) 




(14.1) 




Difference (percentile points) 


9.4** 


0.41** 


8.5** 


0.37** 


Effect size (d) 


0.61 




0.61 




Us 


.73 




.73 




Disability status 


Students with disabilities 


521.0 


0.45 


822.7 


0.37 


(12.6) 




(11.8) 




Students without disabilities 


542.5 


0.20 


842.8 


0.16 


(14.8) 




(12.9) 




Difference (percentile points) 


21.5** 


0.49** 


20.1** 


0.41** 


Effect size (d) 


1.57 




1.62 




Us 


.94 




.95 





**The difference in average scores between students not in poverty and in poverty (NP - P), or students without disabilities and with disabilities [ND - D), is 
statistically significant (different from zero) atp < 0.05. 

Note: Numbers in parentheses are standard deviations. Effect size is calculated as |d| = (M, - M 2 ) / where SDp^^i^^= V[(SD,^ + SD2^)/2]. (fj is the 

proportion of scores in the group with the lower mean that fall below the noted effect size in a standard normal distribution. See appendix A for details. 

Source: Authors' analysis of NECAP data from Vermont Department of Education, Standards and Assessment (2007) and NAEP data from U.S. Department of 
Education, National Center for Education Statistics (2008). 
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TABLE F6 

Grades 4 and 8 Vermont average scaled math scores and effect sizes by poverty and disability status from 
the 2006 New England Common Assessment Program and the 2007 National Assessment of Educational 
Progress, 2006-07 







Grade 4 






Grade 8 






2006 NECAP 


2007 NAEP 


2006 NECAP 


2007 NAEP 


Student 

status 


Mean 


Standard 

error 


Mean 


Standard 

error 


Mean 


Standard 

error 


Mean 


Standard 

error 


Poverty status 


Students in poverty 


437.6 


0.28 


234.4 


0.97 


836.3 


0.27 


277.4 


1.45 


(12.5) 




(27.0) 




(11.7) 




(31.0) 




Students not in poverty 


445.7 


0.17 


251.8 


0.63 


843.2 


0.15 


296.1 


0.91 


(11.5) 




(24.7) 




(11.0) 




(33.1) 




Difference (percentile difference) 


8.1** 


0.33** 


17.4** 


1.16** 


6.9** 


0.31** 


18.7** 


1.71** 


Effect size (d) 


0.67 




0.67 




0.61 




0.58 




Us 


.75 




.75 




.73 




.72 




Disability status 


Students with disabilities 


429.7 


0.51 


220.6 


1.40 


827.7 


0.12 


261.5 


2.15 


(13.8) 




(28.8) 




(12.1) 




(34.2) 




Students without disabilities 


444.9 


0.15 


250.8 


0.59 


843.6 


0.38 


296.5 


0.74 


(11.1) 




(23.6) 




(9.8) 




(30.5) 




Difference (percentile points) 


15.2** 


0.53** 


30.2** 


1.52** 


15.9** 


0.40** 


35.0** 


2.27** 


Effect size (d) 


1.22 




1.15 




1.45 




1.08 




Us 


.89 




.87 




.93 




.86 





**The difference in average scores between students not in poverty and in poverty (NP - P), or students without disabilities and with disabilities [ND - D), is 
statistically significant (different from zero) atp < 0.05. 

Note: Numbers in parentheses are standard deviations. Effect size is calculated as |d| = (M, -/W 2 ) where SDpo„,jrf= V[(SD,^ + SD 2 ^) / 2]. L /3 is the 

proportion of scores in the group with the lower mean that fall below the noted effect size in a standard normal distribution. Seeappendix A for details. 

Source: Authors' tabulations of NECAP data from Vermont Department of Education, Standards and Assessment (2007) and NAEP data from U.S. Department 
of Education, National Center for Education Statistics (2008). 
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NOTES 

1 . 



2 . 



3. 



of their low proportions within the state (see 
table Cl in appendix C). 



The authors thank Jane Nesbitt and the 
Vermont Department of Education for as- 
sistance with this project. The authors also 
acknowledge the contributions of Craig Hoyle, 
Katie Buckley, Karen Clay, Denise Lamb, Jane 
Donnelly, Rebecca Carey, Maria-Paz Avery, 
Natalie Lacireno-Paquet, Sarah Chace, Katie 
Culp, Jessica Brett, Bailey Triggs, and Michelle 
LaPointe. 

Gender differences in aggregate math scores 
have been relatively small in recent years. 
However, due to the greater variability of male 
scores and the underrepresentation of female 
students in the upper tail of the math score 
distribution, the gender gap in math has been 
larger among high- achieving students bound 
for higher education (Lewis and Willing- 
ham 1995). These larger gender gaps in math 
among populations that aim for postsecond- 
ary education may help to explain the existing 
underrepresentation of women in science, 
technology, engineering, and math (National 
Science foundation 2006). 

Lor decades scholars have debated the origin 
of gender gaps in academic and other out- 
comes. The most widely accepted current view 
is that gender differences arise from complex 
interactions between biology and the envi- 
ronment (Halpern 2000, 2004; Kimura 1999, 
2004; Neisser et al. 1996). 

Although national data show that gender gaps 
may vary in important ways by race/ethnicity 
and limited English proficiency status, Ver- 
mont education leaders chose not to examine 
gender gaps within these subgroups because 



5. The NAEP is administered to representa- 
tive samples of grade 4 and 8 students across 
Vermont. Because the NAEP does not assess 
all students, it cannot be used to measure 
adequate yearly progress in reading and math 
as required by the act (appendix B). 

6. Samples for the Vermont and U.S. NAEP 
assessments each include more than 1,000 
students and are therefore large. See NAEP 
Technical Documentation at http://nces. 
ed.gov/nationsreportcard/tdw/sample_design/. 
Although the Vermont sample forms a subset 
of the larger national sample, the Vermont and 
U.S. NAEP samples were treated as indepen- 
dent because the Vermont fraction of the 
national sample is likely to be extremely small. 
The NAEP does not report the proportion of 
Vermont students within the total national 
sample, but according to 2005/06 data from the 
U.S. Department of Education, National Center 
for Education Statistics (2006a), Vermont had a 
total enrollment of 96,638 students— less than 
2 percent of the national total (49.1 million). 
The proportion of Vermont students within the 
U.S. NAEP sample should therefore also be less 
than 2 percent. Any correlation between NAEP 
scores in Vermont and the United States is 
therefore likely to be trivial. 

7. The Vermont NAEP reading and writing 
assessments were first administered in 2002. 
The state’s NAEP math assessment was first 
administered in Vermont in 1996, but it dif- 
fered from the one administered after 2000. 
This project therefore focused on NAEP math 
data from 2000 to 2007. 
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